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Abstract 



Let A be a p-variate real Wishart matrix on n degrees of freedom with identity covari- 
ance. The distribution of the largest eigenvalue in A has important applications in multivari- 
ate statist i cs. C onsider the asymptotics when p grows in proportion to n, it is known from 
Johnstone! ( 2001 ) that after centering and scaling, these distributions approach the orthogonal 
Tracy- Widom law for real-valued data, which can be numerically evaluated and tabulated in 
software. 

Under the same assumption, we show that more carefully chosen centering and scaling 
constants improve the accuracy of the distributional approximation by the Tracy- Widom limit 
to second order: 0((riAp) -2 ' 3 ). Together with the numerical simulation, it implies that 
the Tracy- Widom law is an attractive approximation to the distributions of these largest 
eigenvalues, which is important for using the asymptotic result in practice. We also provide a 
■ parallel accuracy result for the smallest eigenvalue of A when n > p. 

(N . 

£T) . Key Words and Phrases. Eigenvalues of random matrices, Laguerre orthogonal ensemble, 

Laguerre polynomial, Liouville- Green method, principal component analysis, rate of convergence, 
Tracy- Widom distribution, Wishart distribution. 
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^ ■ 1 Introduction 

> 

The central object of multivariate statistical analysis is an n x p data matrix X, where each 



5_i ■ of the n rows corresponds to an observation of a random vector in a p-dimensional space. If we 



assume that the row vectors are i.i.d. samples from a multivariate Gaussian distribution N p (fi, £), 
much of the classical theory in multivariate statistical analysis is reduced to study of the eigen- 
decomposition of a random matrix following a Wishart distribution. Typical examples include 
but are not limited to principal component analysis (PC A), factor analysis and multidimensional 
scaling (MDS). The fundamental setting is the determinantal equation 

det(A- XI) =0, 

where A follows a central Wishart distribution with covariance matrix S. 

In this setting, a common null hypothesis is Hq : £ = I. For instance, in PC A, this is the 



hypot hesis of isotropic variation over all the principal components; see, for example, iMardia et al 



(|l979l . Section 8.4.3). If Hq is true, we say that we are in the null case and call A a (real) 



white Wishart matrix. For testing this particular hypothesis, as for many others in multivariate 
statistics, there are two different systematic strategies: one is the likelihood ratio test (LRT), 
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which uses all the eigenvalues of A; the other is the union intersection test (UIT) initiated by iRov 
( 19531 ). which utilizes only the largest (or smallest) eigenvalue of A for the current problem. 

An inconvenience of using UIT is that the exact evaluation of the marginal distribution of 
the extreme sample eigenvalues is not simply tr actable, even in the null case considered here. 
Interested readers are referred to Muirhead ( 19821 . Section 9.7) for the expressions of the marginal 
distributions in terms of hyper geometric function of ma trix argument; see, in parti cular, Corollary 
9.7.2 and 9.7.4 there. We remark that recent work of Koev and Edelman ( 20061 ) has developed 
efficient evaluations of hypergeometric functions of matrix argument and made the computation 
of the exact marginal distributions possible when both n and p are small. 

An alternative approach is to approximate these exact finite sample distributions of the ex- 
treme eigenvalues by some other well-understood asymptotic distribution. This kind of approx- 
imation is ubiquitous in statistics: the normal approximation to the distribution of the Wald 
and score statistics, the Chi-square approxi mation to t h e Pea rson statistic in fitting contingency 
tables, etc. For the problem studied here, lAnderson d2003l . Chapter 13) provides a complete 
summary of the established results in the conventional regime of asymptotics: 



p is fixed and n 



oo. 



However, many modern data (microarray data, stock prices, weather forecasting, etc.) we are now 
dealing with typically have the number of features p very large while the number of observations 
n much smaller than or just comparable to p. For these situations, the classical asymptotics is 
no longer always appropriate and new asymptotic results that could handle this type of data are 
desirable. 



An advance in this direction was made in lJohnstond (|200ll ). where the asymptotic regime was 
switched to 

p — > oo, n = n(p) — ► oo and n/p — > 7 G (0, 00). (1) 

To state his result, let X be an n x p data matrix with the n rows i.i.d. following N P (Q,I). The 
pxp matrix A = X'X has a standard Wishart distribution: A ~ W P (I, n). We denote the ordered 
eigenvalues of A by Ai > • • • > A p . B orrowing tools from the field of Ran dom Matrix Theory 
(RMT), especially those established in iTracv and Widoml (jl994l . ll99fiL Il998l ). Johnstone showed 
that if we define centering and scaling constants as 



(y/n-1 + y/p) , dp = (y/n-1 + y/p) 



n 



= + 

1 Vp 



1/3 



then under condition (CD 



Ai — fj, p v 



0",i 



Wi~.Fi, 



(2) 



(3) 



where F\ is the orthogonal Tracy- Widom law, which was originally found by Tracy and Widoml 
(jl996h as the limiting law of the l argest eigenvalue of a p x p real Gaussian symmetric matrix. We 
remark that, prio r to iJohnstond (|200lh , as a byproduct of his analysis on random growth model, 
Johansson ( 2000l ) established the scaling limit for the largest eigenvalue in complex white Wishart 
matrix, which turns out to be the unitary Tracy - Widom law F2. We'd also like to mention that 
for the weak limit © to hold. IeI Karouif (|2006al ) extended the asymptotic regime ([TJ to include 
the cases where n/p — > or 00. 

This type of asymptotic result, albeit emerging only recently in the statistics literature, has al- 
rea dy found its relev a nce to applications with modern data. For instance, based on the weak limit 
Patterson et~aH d2006l ) developed a formal test for the presence of population heterogeneity 
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in a biallelic dataset and suggested a systematic way for assigning statistical signific ance to sue 



cessive eigenvectors, which in turn has b een used to correc t population stratification (jPrice et al 



20061 ) and to perform genetic matching (jLuca et all 120081 ) in genome-wide association studies 

From a statistical point of view, to inform the use of any asymptotic result in practice, we 
need to have an understanding of the accuracy of the approximation to finite distributions by the 
limit, which usual ly appears in the form of a rate of convergence result. In the complex domain, 
El Karouil (j2006bh established such a result for Johansson's theorem with carefully chosen cen- 
tering and scaling constants. With his choice, the error term in the Tracy- Widom approximation 
could be controlled at the order O f(n A p)~ 2//3 ), as opp osed to O ((rt A p)" 1 / 3 ) by using the origi- 
nal centering and scaling constants in I Johansson ( 2000 ) . For an up-to-date survey of higher order 
accuracy results of this fashion, we refer to Johnstonel ( 2006 . Section 3). 

In statistics, we are typically more interested in real-valued data. However, for technical 
reasons, results for complex-valued data are usually easier to derive under the asymptotic regime 
(TJJ) than in the real case. Rece ntly, in analyz i ng th e parallel problem for the greatest ro ot statistics 
for pairs of Wishart matrices ( Mardia et al. . 19791 . Definition 3.7.2), Johnstone ( 2007 ) figured out 
a way to connect the central object of study in the real case to that in the complex case. To 
be more specific, in both real and complex cases, the problem reduces to the study of operator 
convergence in appropriate metrics by using standard techniques from Random Matrix Theory. 
The key observation there is that the crucial element of the operator kernel in the real case could 



be rep resented in closed form as a rank o ne perturba t ion of the complex kernel; see iJohnstone 
TJ, Eq.(50)), which is a conse quence of lAdler et al.1 ((2000J, Proposition 4.2). 
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Inspired by IJohnstone (|2007l ). we investigate in this paper the rate of convergence for the 
distributions of properly rescaled largest eigenvalues in real wh ite Wishart matric es to the or- 
thogonal Tracy-Widom law. We remark that, instead of using lAdler et al.1 t0Q(\ . P roposition 
4.2), the central formula (|15p fo r the "c o mplex to real" connnection in our paper is derived from 
a slightly earlier result given in IWidoml (Il999l . Section 4) which is specific to white Wishart ma- 
trices. This new approach not only helps to avoid introducin g a further nonlinear transformation 
after rescaling the largest eigenvalues as in Johnstonel ( 2007 ) but also enables us to make direct 
use of the analysis done in lEl Karouil (|2006bT T for complex white Wishart matrices. 



Statement of the theoretical result. It was suggested in Ijohnstonel (|2006l ) that if we modify the 
centering and scaling constants from ([2]) to 



f^np 



a 



np 




(4) 



we might obtain second order accuracy in the Tracy-Widom approximation. 

Indeed, the main theoretical result of the paper can be formulated as the following theorem, 
which establishes the above conjecture. 

Theorem 1. Let A ~ W p (I,n) and Ai be its largest eigenvalue. Define centering and scaling con- 
stants {jlnp^np) as in @, then under condition (pTJ) , there exists a continuous and nonincreasing 
function C(-), such that for all real sq, there exists an integer No(so,~/) for which we have that 
for any s > so and n A p > No(so,^/), 

\P{M <pLnp + Z n ps}-Fx(s)\ < C(s )(nAp)- 2 / 3 exp(-s/2). 
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The theorem provides theoretical support for using the Tracy- Widom law F\ as approximate 
largest eigenvalue distribution in the null case. In addition, the numerical investigation pursued 
in Section 12,11 shows that the approximation yields reasonable accuracy even when n and p are as 
small as 2. Therefore, both theoretical and numerical results provide us with the confidence in 
using the Tracy- Widom approximation for nearly all finite n x p distributions, at least under the 
Wishart assumption. 

Remark 1. In fact, Theorem [J wi ll be proved onl y when p is even and n ^ p since our method 
relies on a determinant formula of lde Bruiinl (|1955l ) which was only established for p even and the 
Laguerre polynomials which are essential for building the convergence rate are not well-defined 
when n = p. It would be of interest to have some theoretical support for the p odd and the square 
cases. However, numerical experiments suggest that the Tracy- Widom approximation works just 
as well for p odd as for p even and for n = p as for n ^ p. 



Organization of the paper. In Section [2J we first investigate the numerical quality of the 
Tracy- Widom approximation for finite nx p distributions, then review some important statistical 
settings to which our result is relevant and finally discuss several interesting issues involved in this 
study, including a parallel result for the smallest eigenvalue. The rest of the paper is dedicated 
to the proof of Theorem [H mainly with tools from Random Matrix Theory. In Section (3J we 
start with the formulation of Theorem [1] in RMT terminology. After that, we derive the central 
formula (|15p in this paper and reduce our problem to the study of operator convergence in some 
appropriate metric. We sketch our proof of the main result in Section 0] by assembling operator 
theoretic tools and asymptotic bounds on transformed Laguerre polynomials. Finally, Section 
[5] gives details of the Laguerre asymptotics required in the proof. Appendix [A] collects various 
necessary technical details not spelled out fully in the main text. Appendix [B] discusses the issues 
mentioned in Section 12.31 in a more concrete manner. 



2 Statistical Implications and Discussion 
2.1 Quality of the approximation 

An important motivation for the current study is to promote practical use of the Tracy- Widom 
approximation. For example, one could tabulate the F\ table and use it to compute p- values. With 
such motivation, we investigate the quality of the approximation with numerical experiments. 



Distributional approximation. First of all, we study the numerical accuracy of the approx- 
imation using our centering and scaling constants and compare it with that of the original 
proposal ([2]) in Ijohnstond (120011 ) , with results summarized in Table CD We first look at the square 
cases with n = p = 2, 5, 20 and 100 and then the cases with the same p's but with the ratio njp 
fixed at 4 : 1, and finally the cases where p = 5 and 10 with njp raised to as high as 10 : 1 an d 
1000 : 1, which, in some sense, fall into the situation njp — > oo as discussed in lEl Karouil (|2006ah . 
Finally, in all these cases, we use R = 40, 000 replications. 

In terms of accuracy, from the last three columns of Table [H the approximation seems rea- 
sonable at conventional significance levels of 10%, 5% and 1% (corresponding to right-hand tails 
of the distributions) even when p is as small as 2 or 5, and keeps improving as p grows large, 
regardless of the njp ratio. When p is large, for instance, in the 100 x 100 and 400 x 100 cases, the 
Tracy- Widom law yields reasonable approximation over the whole range of interest and matches 
the finite distributions almost exactly on the right-hand tail. 
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Percentiles 


-3.8954 


-3.1804 


-2.7824 


-1.9104 


-1.2686 


-0.5923 


0.4501 


0.9793 


2.0234 


TW 


.01 


.05 


.10 


.30 


.50 


.70 


.90 


.95 


.99 


2x2 


.000 


.000 


.000 


.034 


.379 


.690 


.908 


.953 


.988 




(.000) 


(.000) 


(.000) 


(.015) 


(.345) 


(.669) 


(.902) 


(.950) 


(.987) 


5x5 


.000 


.002 


.021 


.218 


.465 


.702 


.908 


.954 


.989 




(.000) 


(.002) 


(.020) 


(.213) 


(.460) 


(.698) 


(.907) 


(.953) 


(.989) 


20 x 20 


.003 


.029 


.071 


.275 


.490 


.700 


.902 


.952 


.990 




(.003) 


(.029) 


(.071) 


(.274) 


(.489) 


(.699) 


(.901) 


(.952) 


(.990) 


100 x 100 


.008 


.044 


.091 


.291 


.495 


.699 


.901 


.951 


.990 




(.008) 


(.043) 


(.091) 


(.291) 


(.495) 


(.699) 


(.901) 


(.951) 


(.990) 


8x2 


.000 


.000 


.013 


.200 


.458 


.704 


.913 


.956 


.990 




(.000) 


(.004) 


(.031) 


(.274) 


(.534) 


(.755) 


(.931) 


(.966) 


(.992) 


20 x 5 


.001 


.018 


.057 


.262 


.486 


.703 


.905 


.952 


.990 




(.002) 


(.028) 


(.077) 


(.305) 


(.533) 


(.739) 


(.919) 


(.960) 


(.992) 


80 x 20 


.005 


.035 


.082 


.287 


.497 


.700 


.902 


.951 


.990 




(.006) 


(.043) 


(.096) 


(.312) 


(.524) 


(.723) 


(.911) 


(.956) 


(.992) 


400 x 100 


.009 


.047 


.095 


.298 


.499 


.700 


.899 


.949 


.989 




(.010) 


(.052) 


(.103) 


(.312) 


(.514) 


(.712) 


(.905) 


(.952) 


(.990) 


500 x 5 


.010 


.050 


.100 


.303 


.502 


.705 


.905 


.953 


.992 




(.022) 


(.084) 


(.154) 


(.387) 


(.589) 


(.770) 


(.932) 


(.968) 


(.995) 


5000 x 5 


.012 


.056 


.108 


.311 


.511 


.711 


.910 


.957 


.993 




(.027) 


(.098) 


(.169) 


(.406) 


(.606) 


(.783) 


(.938) 


(.971) 


(.995) 


1000 x 10 


.010 


.049 


.099 


.296 


.500 


.701 


.902 


.952 


.991 




(.017) 


(.073) 


(.136) 


(.363) 


(.567) 


(.754) 


(.925) 


(.964) 


(.993) 


10000 x 10 


.012 


.054 


.104 


.306 


.506 


.707 


.903 


.950 


.991 




(.022) 


(.084) 


(.148) 


(.381) 


(.579) 


(.764) 


(.927) 


(.964) 


(.994) 


2 x SE 


.001 


.002 


.003 


.005 


.005 


.005 


.003 


.002 


.001 



Table 1: Simulations for finite n xp vs. Tracy- Widom approximation: a ccuracy comparison of the 
new centering and scaling constants ([3]) with that in Johnstone ( 200ll ). For each combination of 
n and p, we show in the first line the estimated cumul ative probabilities for Ai, rescaled using (j4]); 
and in the second line with parentheses, rescaled using I Johnstone! (|200ll . Eq.(1.3) and (1.4)), both 
co mputed from R = 40, 000 repea ted draws using the Beta-ensemble sampling technique proposed 
by Dumitriu and Edelman ( 20021 ). The conventional significance levels are highlighted in bold font 
and the last line gives approximate standard errors based on binomial sam pling. The orthogonal 
Tracy - Widom distribution F\ was computed using the method proposed in Edelman and Persson 
(120021 ) with percentiles obtained by inverse interpolation. 



In terms of the comparison with the original centering and scaling constants, we could see 
from the first block of Table [T] that in the square cases, neither method seems superior to the 
other. However, when the ratio n/p is changed to 4 : 1 or larger (see the second and the third 
blocks of Tabled]), the improvement by using the new constants is substantial. The new constants 
not only provide better absolute accuracy in most of the cases, but also seem to result in a faster 
convergence to the limiting distribution F\. 

Last but not least, the good performance on the right tail and the faster convergence by using 
the new constants, as reflected in Table [IJ support our theoretical bound in Theorem [TJ 

Approximate percentiles. Except for computing p-values, F\ could also be used to compute 
approximate percentiles of finite nx p distributions. To measure the accuracy of this approxima- 
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Figure 1: Plots of the relative error r a = 8^ w /9 a — 1 for the approximate percentiles computed 
from F\\ (a) 95-th percentile; (b) 99-th percentile. The exact finite nxp largest eigenvalue distri- 
butions are computed using Plamen K oev's implementation in MATLAB of the recursive method 
proposed in lKoev and Edelmanl (120061) and the orthogona l Tracy- Widom law F\ is computed us- 
ing the method proposed in lEdelman and Perssonl (120021 ) . The percentiles are always obtained 
from inverse interpolation. 



tion, we consider the relative error r a = 9^ w /9 a — 1, where 9 a is the exact 100a-th percentile 
of the largest eigenvalue in finite nxp model and 9^ W is its counterpart obtained from the 
Tracy- Widom law. 

In Figure [H we plot the relative error r a for a = 0.95 and 0.99, with p ranging from 2 to 5 and 
n from 2 to 50. Although the minimum of n and p is no larger than 5, the numerical accuracy 
is reasonably satisfactory. For the 95-th percentile, the relative error ranges from 5% to 10% for 
most of the cases and slightly exceeds 10% only for the cases where p = 2 and the n/p ratios 
are high. The approximation to the 99-th percentile is even better, with the absolute relative 
error |r 99I < 5% for most of the cases. Due to the computational limitation (jKoev and Edelman . 
ioofih . we could not compute the exact percentiles when n and p are large. However, we expect 



the approximate percentiles to become more accurate as the consequence of better distributional 
approximation . 



2.2 Related statistical settings 

In this part, we review several common settings in multivariate statistics to which our result is 
relevant. 



Principal component analysis. Suppose that X = [x%, • ■ ■ ,x n ]' is a Gaussian data matrix. 
Write the sample covariance matrix S = n~ l X'HX, where H = I — is the centering 

matrix, principal component analysis looks for a sequence of standardized vectors a\, ■ ■ ■ ,a p in 
W, such that for i = 1, ■ • • ,p, where aj successively solves the following optimization problem: 

max{a'Sa : a'dj = 0, j < i} , 

where ao can be taken as the zero vector. The successive sample principal component eigenvalues 
l\> ■ ■ ■ > l p then satisfy ti = a^Sa-i. From a different perspective, these ^'s may also be found 
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as the roots of the determinantal equation 

det(5 - XI) = . 

One basic question in the application of PCA is testing the hypothesis of isotropic variation, 
i.e., the hypothesis that all the population principal component eigenvalues are equal. Under this 
null hypothesis, the population covariance matrix of the row vectors in X is a 2 1. For simplicity, 
let us suppose that a 2 = 1 (if a 2 is an unknown value, we can estimate it by some a 2 first and 
divide S by a 2 ). Then the sample covariance matrix S satisfies 

nS~W p (I,n-l). 

The largest principal component eigenvalue t\ of S is a natural test statistic for a union 
intersection test. Our result applies for ni\. 



Multidimensional scaling. Let X be an n x p data matrix. Consider the centered inner 
product matrix B = HXX'H, i.e. Bij = (ccj — x)'(xj —x). In a typical setting of multidimensional 
scaling, we are usually only given the matrix B instead of the original observations X. Let 
Ai > • • • > X p be the or dered eigenvalues of B and Vi be the corresponding eigenvector. As defined 



m 



Mardia et al.l (|1979l . Section 14.3): for fixed k (1 < k < p), the rows of = (v\, ■ ■ ■ ,Vk) are 



called the principal coordinates of X in k dimensions, which constitute the classical fc-dimensional 
solution to the multidimensional scaling problem. 

We observe that the matrix B shares its non-zero eigenvalues with nS = X'HX. For the 
principal coordinate method to make sense, it is important that non-zero eigenvalues of B and 
hence all the eigenvalues of nS do not equal a common value. Translated to the population level, 
the population covariance matrix X ^ a 2 I. Assuming a 2 = 1 (or dividing B by a 2 or its estimate 
a 2 ), the null hypothesis can be written as H$ : £ = I. As in the situation of PCA, our result is 
useful for the test statistic £±, where l\ is the largest eigenvalue of B. 



Testing that a covariance matrix equals a specified matrix. Suppose that we have the 
Gaussian data matrix X with the rows x±, ■ ■ ■ ,x n be independent N p (fj,, S) random vectors and 
consider the null hypothesis H : £ = So, where So is a specified positive definite matrix. 

If the mean vector fi is unknown, let S = n~ l X'HX be the sample covariance matrix. The 
union intersection te st uses the largest ei genvalue of the matrix Sq 1 S, denoted by Ai(S ~ 1 S'), as 
the test statistic (see Mardia et al. . 19791 . p. 130). 



We observe that Ai(E 1 5') 



1/2, 



Ai(Sq 



SEq 1 ^ 2 ), where under the null hypothesis, 



o 



-1/2 



W p (I,n-l) 



Hence, our result is available for nAi(£ 1 S'). 



For X a real n x p matrix, there exists orthogonal matrices 
U DV T , 



X 



onxp 



Singular value decomposition. 

U(n x n) and V(p xp), such that 

where D = diag(ai, • • • , cr min(niP) ) G 

called the singular value decomposition of X [See iGolub and van Loan! (|1996l . Theorem 2.5.2)]. 
For 1 < i < min(n,p), <Tj is called the i-th. singular value of X. Theorem [T] then provides an 
accurate distributional approximation for a\ when the entries of X are independent standard 
normal. 



and o\ > ■ ■ ■ > a 



> 0. This representation is 
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2.3 Other issues 



For here, we provide brief remarks on several interesting issues that we come across during the 
development of this work. More details about them could be found in Appendix [Bl 



Transformation. In the analysis of the greatest root statistic, Johnstonel ( 20071 ) suggested that 
a nonlinear transformation [the logit transformation: t(x) = log[x/(l — x)\ in his case] helps 
improve the distributional approximation by the Tracy- Widom law, see Theorem 1, Table 1 and 
Fig. 1 there. In addition to its numerical effect, the transformation has an geometric explanation 
and yields a very natural integral repre sentation for the correlation kernel which later appears 
in the central fo rmula Eq.(50) there; s ee I Johnstonel (120071 . Section 2.2, also Eq.'s (16) and (46)), 
Forrester! (12004 Propos i tion 4 .11) and Udler et all (|2000l . Proposition 4.2). 

Following Forrester ( 20041 . Proposition 4.11), if we wanted to employ a comparable transfor- 
mation for our white Wishart case, it would be the logarithmic transformation: t(x) = logx. In 
fact, in our study, we first looked into some depth along this direction and could conclude the 
following second order accuracy result: under the condition of Theorem [TJ let v np = log/2 np and 
T np = &np/fi'np, there exists a continuous and nonincreasing function C(-), such that for all real 
sq, there exists an integer Nq(sq, 7) for which we have that for any s > sq and n A p > Nq(sq,^/), 



|P{log Ai < u np + r np s} 



Fi(s)\ < C( So )(nAp)" 2 / 3 exp(- S /2). 



(5) 



Some comments on how this result could be derived are included in lB.li 
Although the rates of convergence are the same, numerical experiments suggest that using the 
nonlinear transformation does not yield as good numerical results in distributional approximation 
for small to moderate n and p as simply rescaling Ai using Q, especially on the right-hand tail 
which is of the most statistical interest. When n and p grow large, using the transformation or 
not does not have as much influence, as they approach the same limit. 

In consideration of the actual quality of approximation, especially for small to moderate n 
and p, we suggest not using the logarithmic transformation for the largest eigenvalues. However, 
it is of the oretical interest t o know why such natural transformation works for the greatest root 
statistic in I.Tohnstonel (j2007h but not for the largest eigenvalue in white Wishart matrices here. 



The smallest eigenvalue. Following the principle of union intersection tests, the smallest 



eigenv alue could also serve as the test statistic in some cases, see, for instance, iMardia et al 



(Il979l . Section 5.2.2c). Hence, what we have established for the largest eigenvalue is also worth 
investigation for the smallest one. Moreover, understanding the deviation of the smallest eigen- 
value from its almost sure limit is also of independent interest. For example, it plays an important 
role in t he theory of sp arse s ignal recovery fr o m lar ge underdetermined linear system. See, for 
example, Donoho ( 2004 ) and Candes and Tao ( 20061 ) . In fact, as we studied the accuracy result 
for the largest eigenvalue using the logarithmic transformation, we obtained a parallel result for 
smallest eigenvalues as a pleasant byproduct. We state without proof the result here. 

Suppose that n — 1 > p and n/p — > 7 G (1, 00) and introduce the reflect Tracy- Widom law 
(|PaulU2006h as 

Gi(s) = l-F 1 (-s). 



Let 



np 



n- 2 



a 



up 



n- 2 




1/3 



n 
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and then define 

1 2 

Tnp = Vnp/Vnp, and Kp = l °g Vnp + g ( r np) ■ ( 6 ) 

We then have that for the smallest eigenvalue X p of a p x p white Wishart matrix with n degrees 
of freedom, there exists a continuous and nondecreasing function C(-), such that for all real so, 
there exists an integer Nq(sq, 7) for which we have that for any s < sq and p > Nq(sq, 7), 

|P{log X P < I/" + T- p s} - Gi(s)| < C(s )p- 2/3 exp(s/2) . (7) 
See lB.2l for remarks on how to prove this result. 

Unlike the case for Ai, the logarithmic transformation improves the numerical accuracy of the 
distributional approximation for X p significantly, especially when p is small and n/p is close to 
1. We feel that an intuitive explanation to this phenomenon could be the following: for X p , the 
lower bound at strongly affects the approximation on the original scale, especially when both 
p and n/p are small. However, by transforming X p to logA p , one maps the lower bound to —00 
and hence avoids this 'hard edge' effect. The largest eigenvalue does not enjoy such a benefit for 
it does not have an algebraic upper bound. 

As a numerical illustration, in Table [2j we present some simulation results on the Tracy- 
Widom approximation to smallest eigenvalues transformed as above for two n/p ratios: 2 : 1 and 
4 : 1, both with p = 5, 10 and 100. Again, for each combination of n and p, we run R = 40, 000 
replications. The approximation seems good on the left-hand tail (where traditional significance 
levels locate) even for p as small as 5, regardless of the n/p ratio. Moreover, for both n/p ratios, 
when p grows to 100, the approximation becomes reasonably accurate over the entire range under 
investigation and is almost perfect on the left-hand tail. Therefore, the numerical results agree 
well with the theory for the smallest eigenvalues, too. 



Percentiles 


3.8954 


3.1804 


2.7824 


1.9104 


1.2686 


0.5923 


-0.4501 


-0.9793 


-2.0234 


RTW 


.99 


.95 


.90 


.70 


.50 


.30 


.10 


.05 


.01 


10 x 5 


1.000 


.995 


.976 


.796 


.553 


.306 


.093 


.045 


.012 


20 x 10 


.999 


.984 


.952 


.760 


.536 


.305 


.098 


.049 


.011 


200 x 100 


.993 


.958 


.910 


.708 


.504 


.301 


.099 


.050 


.010 


20 x 5 


.998 


.977 


.939 


.745 


.527 


.306 


.097 


.049 


.010 


40 x 10 


.996 


.969 


.926 


.726 


.511 


.300 


.098 


.048 


.010 


400 x 100 


.993 


.955 


.905 


.703 


.501 


.301 


.100 


.050 


.010 


2 x SE 


.001 


.002 


.003 


.005 


.005 


.005 


.003 


.002 


.001 



Table 2: Simulations for finite n x p vs. Tracy-Widom approximation: the smallest eigen- 
value. For each combination of n and p, the estimated cumulative probabilities are computed 
for (log X p — Vnp)/ T np with R = 40, 000 draws from W P (I, n). The methods of sampling, comput- 
ing F\ and obtaining percentiles are the same as in Table [TJ The conventional significance levels 
are highlighted in bold font and the last line gives approximate standard errors based on binomial 
sampling. 



3 Random Matrix Theory 

The establishment of Theorem [1] relies heavily on results and methods from Random Matrix 
Theory (RMT) literature. In particular, those about unitary and orthogonal Laguerre matrix 
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ensembles play an important role. In this section, we first restate our main result using RMT ter- 
minology. With a Lipschitz-type bound, we transform the problem into the study of convergence 
rate of operators with matrix kernels and derive the closed form representation (|15p of the top-left 
entry in the kernel for Laguerre orthogonal ensemble. Finally, we study the effect of scaling on our 
kernel representation and carefully formulate the analysis problem to be solved in later sections. 



3.1 Restatement of Theorem Q] in Random Matrix Theory 

Suppose A is an N x N matrix following a Wn(I, n) distribution with n > N. [Here and after, 
following the RMT notational convention, we use N rather than p to denote the number of 
features.] T he celebrated joi nt probability density function of the eigenvalues x\ > • • • > xjy > 
is given by (jMuirheadl . Il982l ) : 

N 

Vn{xx,--- ,x n ) = d~^ N Yl {x j -x k )J\_x n - N - 1 e- x ^ 2 , 

±<j<k<N j=l 

where d n) N is a normalizing constant depending only on n and N. 

On the other hand, RMT people have investigated Laguerre Orthogonal Ensembles (LOE), 
where 'ensemble' stands for distribution of matrices and 'orthogonal' refers to the invariance of the 
distribution under orthogonal transformations. The LOE(A r , a) model (a > —1) has the matrix 
eigenvalue density as 

N 

p N {x u ■■■ , XN ) = d^ N Yl ( x i ~ x k) II x f e ~ Xj/2 > ( 8 ) 

±<j<k<N 3=1 

where x\ > • • • > xn > and da,N is a normalizing constant depending only on N and a. 

If we define qat = n — N, the joint eigenvalue density of white Wishart matrix A is exactly the 
eigenvalue density of the LOE(iV, — 1) model. By this observation, we can formulate Theorem 
[1] in terms of RMT as the following. 

Theorem 2. Let x\ be the largest eigenvalue in the LOE(N, — 1) model and F\ be the orthog- 
onal Tracy- Widom law. Define /2 nj _/v o,nd & n ^N o,s 



fin,N = [ \/n - \ + JN - i , a n , N =(Jn-\ + \lN -±) \ r — + 




1/3 



1 1 



(9) 

If n > N , N — > oo,n = n(N) — > oo and n/N -» 7 £ [l,oo), there exists a continuous and 
nonincreasing function C(-), such that for all real so, there exists an integer Nq(so,^) for which 
we have that for any s > sq and N > Nq(sq,^), 

\P{xi < fin,N + v n , N s}-F 1 (s)\ < C(s )iV~ 2/3 exp(- S /2). 

Remark 2. The theorem is stated only for situations where n > N. It works equally well when 
n < N by switching n and N. This results from the following observations: (a) constants in ([9]) 
are symmetric in n and iV and (b) switching n and N does not change the distribution of x\. 
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3.2 Operator determinant and kernel representation 

We focus on the LOEfiV , a) model in ([8]) for the moment. For general orthogonal ensembles, 
Tracy and Widoml (|1998l . Section 9) showed that when iV is even, for x = Ix>x' : 



F NA (x') = P{ Xl < x'} = y/det(I - K NX ) 
with Kn an operator with a 2 x 2 matrix kernel: 

I -d 2 



ex 



Sn,x( x iV) 





e(x -y) 



(10) 



(11) 



where d 2 is the differential operator with respect to the second argument, e\ is the convolution 
operator acting on the first argument with the kernel e{x—y) = ^sgn(x— y) and TS(x, y) = S(y, x) 
for any kernel S. Howev er, no explicit representation of Snx was given there. 

In a follow-up paper, IWidoml )l999h derived explicit exp ression of th e kern el Snx f° r Gaussian 
and Laguerre orthogonal ensembles, which is summarized in lAdler et al. ( 2000l . Eq.(4.3)) in a more 
friendly form. In par ticular, for the LO EfiV, a) model of our interest, we have [Warning: we need 
to switch x and y in lAdler et all (j2000l . Eq.(4.3)).]: 



S N>1 (x,y) = S N , 2 (x,y)+ 



Nl 



4T(N + a 



. x «/2 e -x/2 



dx 



L%(x) 



n(y - z)z & / 2 - 1 e- z / 2 [L%(z) - L^^dz, 



(12) 



where L% (k = N — 1,JV) are Laguerre polynomials defined in Szegol ( 19751 . Chapter V) and 
Sn,2{x,d) is the kernel related to the Laguerre unitary ensemble (LUE) with parameter (N,a), 
which has the following eigenvalue density: 



iV 



Pn(xx, 



,x N 



c u!n n ( x j - x k?Yix^ 

±<j<k<N j=X 



x± > • • • > xn > 0. 



With (|12j) . we start to d e rive a n closed form representation for Sn,x after some necessary 
definitions. As in I Johnstone! ( 200lh . we define a basis {4>k} ( kLo on ^ 2 ([0,oo)) with transformed 
Laguerre polynomials 



<f>k(x;a) 



k\ 



(k + a)\ 



x & ' 2 e- x / 2 Ll(x). 



(13) 



Then calling = \J N(N + a), we follow |E1 Karouil (|2006bl . Section 2) to introduce for x > 0, 



[x: a 



) = ( _!)iv /^ Mx;5 _ 1)a; -i/2 ; V , (x;5) = ( _i)iv-i + (14 ) 



Wit h the definition in (|14|) . for the first term in (|12j) . I Johnstone! (|200ll . Eq.(3.6)) and lEl Karoui 
( 2006bl . Appendix A. 5) gave the following integral representation 



SN,2(x,y)= / <j)(x + z)ip(y + z) + ip(x + z)<j){y + z)dz. 
Jo 

For the second term, we could apply ISzeg | (|l975l . Eq.(5.1.13), (5.1.14)) to obtain that it equals 
Nl 



4r(A^ + d)' 



^ 2 e-^ 2 L% + _\(x) / sgn(y - z)z & / 2 - 1 e- z / 2 L%'\z)dz = if,(x) / e(y - z)<t>{z)dz. 
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Hence, we obtain 



S N x(x,y) = S Nt 2(x,y) + ip(x) / e{y - z)<j>{z)dz. 



oo 







Recall that for the white Wishart matrix A ~ Wn(I, n), setting = n — N, it is connected 
to the LOE(iV, a) model by the identity a = — 1- Thus, if we use the parameters ./V and a at, 
then the above calculation gives the following representation for SVi : 



S N> i(x, y; a N - 1) = S N>2 (x, y; a N -l) + t/)(x; a N - 1) / e(y - z)(f>{z; a N - l)dz. 



oo 







3.2.1 Framework for deriving the determinant formula 



(15) 



The determinant formula (jlOp introduced at the beginning of this subsection provides the foun- 
dation for the convergence arguments. However, it is worth clarification under which framework 
it i s derived. 

Tracv and Widoml (l2005h described with care the operator convergence of K^X to the limit 



Kqoe for the Hermite finite N ensemble. We adapt and extend their approach to the Laguerre 
finite N ensemble. Therefore, we paraphrase their remarks on the weighted Hilbert spaces and 
regularized 2-determinants under the current setting. 

In the kernel given in fjllf) . the first term on the right hand side has each of its entries finite 
ra nk operators and hence a trace class operator. However, this is not true for e(x — y). According 
to Reed and Simon Theorem VI. 23), it is even not Hilbert-Schmidt on L 2 ([x', oo)). One 



way to take care of this p roblem is to introduce th e weighted L 2 space and to generalize the 
operator determinant as in Tracy and Widoml ( 2005 ). 



To this end, let p be any weight function which satisfies the following two conditions: 

(1) its reciprocal p~ l G L 1 ([0,oo)); and 

(2) each operator that constitutes elements in the first term on the right hand side of (llip is in 

L 2 ([x',oo); /0 )nL 2 ([x',oo);^ 1 ). 



Then, as remarked in Tracv and Widom ( 20051 ) . e: L 2 ([x', oo); p) — > L 2 ([x', oo); p 1 ) is Hilbert- 



Schmidt. Moreover, Kn could now be regarded as a 2x2 matrix kernel on the space L 2 ([x' , oo); /?)© 
L 2 ([x',oo);p^). 

We have thus made clear on which space the kernel acts. In order for the determinant 
formula (|10l) to hold, we need a generalization of the usual Fredholm determinant for trace class 
operators to determinant for Hilbert-Schmidt operators. 

By our condition on p, for Kn = [-fQj]i<ij<2> we regard K\\ and K22 as trace class operators 
on L 2 ([x', 00); p) and L 2 ([x', 00); p _1 ) respectively and off-diagonal elements as Hilbert-Schmidt 
operators: 

K12 : L 2 ([x', 00); p" 1 ) — > L 2 ([x ,00); p) and K 2 \ : L 2 ([x', 00); p) —> L 2 ([x', 00); p _1 ). 

Hence, tr(ifjv) = t r (i^ii) + tr(i^22) is well defined. The regularized 2-determinant of Hilbert- 
Schmidt operator T with eigenvalues p.^ is defined by 



det 2 (/-T) = Y[(l-n k )e^ 

k 
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Then one naturally extends the operator definition of determinants to Hilbert-Schmidt operator 
matrix T with trace class diagonal entries by setting 



det(J — T) = det 2 (J - T) exp(-trT). (16) 



Finally, as remarked in iTracv and Widoml (|2005l ). the resulting notion of det( J — K^) is 



independent of the choice of p and allows the derivation in Tracy and Widom ( 19981 ) that yields 
(HDD, (HI]) and eventually (PT5D. 

Later in Section r4.1.1l we will make a specific choice of p, which not only makes our arguments 
more explicit but also eases the derivation of the right tail exponential decay in our desired bound. 

3.3 Scaling the kernel 

Fixing any real number sq and introducing the linear transformation r(s) = fl n N + s&nNi we are 
interested in the convergence rate of Fn,i(t(s')) to Fi(s') for all s' > sq. 
Define the rescaled kernel K T as the following: 



K T (s,t) = y 7 r'(s)r'(t) K N (r(s),T(t)) = a ntN K N (p, ntN + sa HtN , p n ^ N + ta ntN ). 

We have det(J — Kjy) = det(J — K T ) by noticing that Kn and K T share the spectrum. We give 
below an explicit representation of K T for later use. 

Before we proceed, we apply the r-scaling to <p, ip and 5v,2 and thus define 

(f) T (s) = a n>N <p(fl n!N + sa nsN ), ip T (s) = v n ,Ni>([j>n,N + sa ntN ) (17) 

and 

S T (s, t) = <J n ,NSN,2{Pn,N + Sa n> N, Pn,N + ^n,Af) 

(18) 

4> T (s + z)ip T (t + z) + Vv(s + z)(j) T (t + z)dz. 

For later convenience, (f> T {s) and ip T (s) are assumed to be when r(s) = p n ,N + s&n,N < 0, and 
hence they are well-defined on the entire real line. 
Finally, we introduce the short notation 

/oo 
e(t - z)4> T (z)dz = S T (s,t) + Vv(a) (e^r) W- (19) 
-oo 

[We remind the reader that in the above discussion, we have dropped the explicit dependence on 
a or ajv — 1 to avoid notation nightmare. Henceforth, we mention the explicit dependence only 
for eliminating ambiguity] 

We further observe that the determinant formula does not change if we modify K T as 



K T (s,t) 



K T ,ll(s,t) ^n,NK Ts i2(s,t) 



for the spectrum does not change. Based on this observation and our detailed calculation in IA.21 
we could represent the entries of K T as 

K Till (8,t) = S?(s,t), K T . l2 (s,t) = -d t S?(s,t), 
K Ti2 i(s,t) = (e 1 S?)(s,t)-e(s-t), K T>22 (s,t) = K T , n (t,s). 
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For the desired limit -Fi(s') of the sequence Fzvi(s'), Tracy and Widom ( 20051 ) showed that 



5(a, t) SD(s,t) 
IS{s,t) - e(s -t) S{t,s) 



Fi(s') = \J det(7 — Kqqe), where the operator Kqoe has the matrix kernel 

KcoE(s,t) ■■ 
with the entries given by 

S(s, t) = S A (s, t) + ^Ai(s) (l - J™ M(u)dv^j , SD(s, t) = -d t S A (s, t) - ±Ai(a)Ai(i) 

/oo \ / I s f°° 1'°° \ 

S A (u,t)du + -I J ki(u)du + J k\(u)du J Ai (u) du J . 

Here Sa (s , t) = f °° Ai (s + u)Ai(t + u)du represents the Airy kernel with Ai(-) the Airy function 
defined in lOlverl (]1974l . p.53, Eq.(8.01)). 



By our discussion in Section 13.2.14 it is necessary that both K T and Kqoe belong to the 
following class A of operators 

A = {2 x 2 Hilbert-Schmidt operator matrices A on 

L 2 ([s' , oo); p o t) © £ 2 ([V, oo); p~ l o r) with trace class diagonal entries}. 

This fact will be verified after we choose a specific p in Section 14.1. 11 For the convenience of 
argument, let us assume it for the moment. 

3.4 Lipschitz bound and kernel difference 

Let p N = F NA (s') and = F 1 (s'), we note that \p N - Pool < \p 2 N - pIo\/Poo < C(s )\p 2 N - p 2 



OO I ' 



where C(sq) = 1/Fi(sq) which is continuous and non-increasing in sq. Thus, we are led to the 
difference of the determinants 

\F N ,i( s> ) - f i(s')\ < C(s )\det(I - K T ) - det(I - K GO e)\ ■ (22) 

Remark 3. Here and after, we use C(sq) to denote in general any continuous and non-increasing 
function of sq and C any universal constant, where the actual function and constant might be 
different from display to display. 

To study the quantity on the right hand side of (|22p , our basic tool is the following Lipschitz- 
type bound on the matrix operator determinant for operators in A. 

Proposition 1. For operators A and B in class A and determinants of I — A and I — B defined 
as in ([16]), ifYji 1 1 At ~ B ii\\l + HA? - -Bijlb < 1/2, then 

|det(/ -A)- det(/ - B)\ < M(B) j ^ \\Au - B^ + ^ - B y || 2 I , (23) 

where M(B) = 2 |det(7 — B)\ +2exp [2(1 + \\B\\ 2 ) 2 + \\Bu\\i 

Note that the leading term on the right hand side of (|23l) depe nds only on 6. In this sense 



Proposition Q] is a refinement of Proposition 3 in Johnstone] ( 2007 ). Its proof could be found in 
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By Propositiondl if we could control the entry- wise convergence rate of K T to Kgoe, we will be 
able to bound the right hand side of (|22|) and hence prove our theorem. To this end, a convenient 
expression of the kernel difference K T — Kqqe is helpf ul, we derive such an expression below by 
essentially adapt ing the arguments in Johnstone! ( 2007 . Section 8.3) to the current context. 

According to Nagao and Forrester ( 19951 . Ea~(4.2)). we could calculate [see IA. II for detail] that 
when N is even, 



/oo 
ip T (s; un — l)ds = 0, and 
-oo 

N v\ n - 1)1/4 r (iv±3) 



4> T (s', oin — l)ds 



Tin) 



T{N + l) 



1/2 



For later use, we define (3n = \ (j) T (s)ds. 



Bring in the right-tail integration operator e introduced in I Johnstond (120071 . Section 8.3) as 



(24) 



[eg){s) = / g(u)du, 

J s 

we have the identity (eg)(s) = \ g(u)du — (ig)(s) and hence obtain 

ecp T — f3j\[ — i(j) T , and eip T = —eip T . 
For S T defined in (|18p . by Fubini's theorem [justified by Lemma [1] 

/•oo 

S T (u,t)du = 2(5 N I ip T {t + z)dz = 2(3 N {eip T ){t). 



Observing that for any kernel A(s,t), (eiA)(s,t) = \ A(u,t)du — A(u,t)du, and in- 
troducing the abbreviation a <S> b for rank one operator with kernel a(s)b(t), we have e\S T = 
Pn <8> £ipr — £iS T , and for in (fl"9j) . we have = S T + ip T (8) @n — Vv <8> s<p T , which finally gives 

eiSy = -ei (SV - Vv <8> £<M + /3jv (1 ® eVv - e^V ® 1) • 
By the explicit expressions for iT r entries in (|20[) . 



Kr(a,t) 



S*(s,t) -dtS?(s,t) 
e 1 S?(8,t) S?(t,s) 



+ 











-e(a - t) 



K £ 



where 



L 



and AT e 





-e 



I -d 2 
ex T 

We then decompose K T and Kqoe as follows: 

K T = K^ + K^ + K^ 2 + K e and if GOE = A* + iff + iff + K 

I -d 2 



where by defining G = Ai/\/2 and the matrix kernels L 




L 



e 2 I 



h T 

, we could write down the unspecified components in (f25j) explicitly as 



/ 

■El 



(25) 
and 



K 



R 



L[S T —TpT® £0 T ], 



and = L[S A - G ® eG], K( = -^=Li[G(s)], iff = -^=L 2 [G(t)]. 



/3jv^i[^t(s)], 
1 



A T,2 



1 



(26) 



V2 



V2 
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For Atv to be defined in ([SI]), we will establish in Lemma □ that 4> T = G + A N G' + (iV^ 2 / 3 ), 
so set Gn = G + AtvG", we write the difference 



K R - K R = L[S T - W ® e^r - S A + G 8> e{G N - A N G')} 

= L[S T -S A + A N G (g> G] - L[tp T ®e<f) T -G® eG N ] = 5 R + 5%. 

Set 

/•oo 

5 , A w (s,t)= / G(s + z)G N (t + z) + G N {s + z)G{t + z)dz, 
Jo 

since J °° Ai(s + z)Ai' (t + z) + Ai' (s + z)Ai(t + z)dz = J °° ^ [Ai(s + z)Ai(t + z)] dz = -Ai(s)Ai(t), 
we obtain 

5 R = L[S T - Sa n }- (28) 
Finally, we organize the components of K T — Kqoe as 

K T - K GOE = 5 r + <5 f + 5f + <5 2 F (29) 

where except for 5q and 5 R given in (|27p and (|28p . we further define #f = — Kf for i = 1,2. 

By the bounds (f22j) and (p3|) . we need entrywise bounds on K T — Kgoe to get our final 
convergence rate. By the decomposition in (|29p . the problem reduces to entrywise bounds for 
each of the 5-terms. Since all these entries have explicit representations, this becomes an analysis 
problem which is to be solved in the next two sections. 



4 Proof 

In this section, we prove Theorem [2] [and hence Theorem Q] by focusing on the entries of the 
5-terms in (129p . Besides the RMT analysis performed in Section [3j the proof needs two additional 
toolkits: a) asymptotics of transformed Laguerre polynomials, and b) several operator theoretic 
bounds of Hilbert-Schmidt and trace class norms. 



4.1 Preliminaries 

Here, we introduce some basic results for later repeated use in the proof. Moreover, we make a 
specific choice of the weight function p. 

We start with Laguerre polynomial asymptotics. Recall that with constants j2 n ,N, &n,N m ([9]) 
and functions </>, defined in (|14p , we have defined transformed Laguerre polynomials <p T and ip T 
in (|17|) . Moreover, for the Airy function, we define 

G(s) = ^=Ai(s). (30) 

By (|25p . (|26p and (|29p . the kernels K T and Kgoe an d hence their difference are essentially 
expressed in terms of <j> T ,ip T ,G and their variants. Therefore, we will find the following set of 
asymptotic bounds helpful to the analysis of their behavior. 

Lemma 1. Let cp T , ip T and G be defined as in (|17|) and (|30|) and Ajy to be defined in (|6ip . 
If n > N , N — > oo, n = n(N) — ► oo and n/N — > 7 G [l,oo), there exists a continuous and 
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nonincreasing function C(-), such that for any real number sq, there exists an integer Nq(so,j) 
for which we have that for all s > sq and N > Nq(sq,'j), 

l^rWI.KWl <C( S0 )ex P (- S ); (31) 

|^(s)|,|#(a)| <C(*o)exp(-s); (32) 

^ T ( S )-G( S )|,K( S )-G"( S )| <C(s )iV- 2 / 3 exp(- S ); (33) 

| WW - G(s) - A N G'(s)\ , - G'(s) - A N G"(s)\ < C(s )N- 2 / 3 exp(-s). (34) 

In order not to distract us from the cause of proving Theorem [2j we defer the proof of Lemma 
[T] to Section [5j For the rest of Section |4j let us assume temporarily that Lemma Q] is already 
established. 

In addition to the Laguerre polynomial asymptotics, we need some operator theoretic bounds 
of Hilbert-Schmi dt and trace class norms. This set of tools has been previously established in 



Johnstond (|2007l . Section 8.4.1). For the sake of completeness, we state them here with some 



corrections and modifications that are helpful to our context. 

From now on, we fix a real number sq and consider any s' £ [so, oo). In general, let an operator 
T : L 2 ([s', oo), pi) -> L 2 ([s', oo),p 2 ) defined by 

/oo 
T(u,v)f(v)dv (35) 

for some kernel T(u,v). We obtain that the Hilbert-Schmidt norm ||T||2 of T satisfies 

ll T ll2 = // \T(u,v)\ 2 pi 1 (v)p 2 (u)dudv. 

J J[s',oo) 2 

Following the notation in I.Tohnstonel (j2007h . we introduce the symbol o for the following 
convolution type operator: 

/•oo 

(aob)(u,v)= / a(u + z)b(v + z)dz. 



Among all the operators defined by (|35h . we are interested in those with kernels D of the form 
D(u,v) = a(u)(3(v), or D(u,v) = a{u)j3{v) (a o b) (u, v). We use the following notation for a 
Laplace-type transform: 

/•CO 

C(p)[t] = / e- tz p{z)dz. 



For an operator with kernel of the form D(u,v) = a(u)(3(v)(aob)(u,v), we have the following 
bound on its operator norm: 

Lemma 2. Let D be an operator taking L 2 ([s', oo), pi) to L 2 ([s', oo), p%) and having kernel 
D(u,v) = a(u)(3(v )(a o b)(u, v), where we assume, for u > s' , that 

\a{u)\ < a e aiU , \f3(u)\ < (3 e^ u , \a(u)\ < a e~ aiU , \b(u)\ < 6 e" 6ltl . (36) 

// both £(p^ 1 ) and C{p 2 ) converge for t > c, and 2(a\ — ai),2(6i — 0i) > c, the Hilbert-Schmidt 
norm satisfies 

a -^{c( P2 )[2( ai - ai m P ^)[2( bl - a)]} 172 . 

ai + fti I ) 

If p\ = p 2 , then the trace norm ||-D||i satisfies the same bound. 
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\D\\o < 



Next, we investigate ran k one operators wit h kern els of the form D(u,v) = a(u)(3(v). First, a 
remark taken verbatim from Tracy and WidomI ( 2005 ): the norm of an operator D = a®[3 taking 



L 2 (pi) to L 2 (p2) with kernel D(u,v) = a(u)(3(v) is given by \\D\\ = 1 1 ce 1 1 2,p 2 WPW2 P ~ x • Here, the 
norm can be trace class (if p\ = P2) or Hilbert-Schmidt since they agree for rank one operators. 
Moreover, if a and (3 satisfies the bound (j36H . similar derivation to that for proving Lemma [2] will 
give the following lemma specific for rank one operators. 

Lemma 3. Let D = a <£> (3 be a rank one operator taking L 2 (\s', 00), pi) to L 2 ([s', 00), p^) and 
having kernel D(u,v) = a(u)/3(v), where we assume, for u > s' , that \a(u)\ < etoe aiU and 
\fi( u )\ — /?o e ^ lU - If both C(p^ 1 ) and £>{p2) converge for t > c that —1ot.\,—1{5\ > c, the Hilbert- 
Schmidt norm satisfies 



\\D\\ 2 < ao/3o{/:(p2)[-2ai]£(pr 1 )[-2/3i]} 
If pi = P2, then the trace norm \\D\\i satisfies the same bound. 



1/2 



4.1.1 Choice of the weight function p 

In order to make our arguments explicit and to obtain the exponential decay of the right tail in 
our bound, we feel it convenient to make a specific choice of the weight function p. 
In particular, for v £ (0, 1] and to be specified later in (|45p . on the s-scale, let 

p o t(s) = 1 + exp . (37) 

The above definition implies that on the x-scale, we specify the weight function as 

'v\x ~ An,iv| 



p(x) = 1 + exp 



o~ n ,N 



We remark that on the x-scale, our choice of p depends on N. 

First of all, we check that our choice of p [on the x-scale] satisfies the two required conditions 
spelled out in Section [3.2.11 Condition (1) holds for p _1 (x) x exp(— vxja n ^) as x — > 00. Condi- 
tion (2) holds if 4>, ifi, (ft, tp', E(j) and eip belong to L 2 ([x', 00); p) n L 2 {[x' , 00); p~ x ). We take (p an d 
tjj as examples, while the argument for the rest is essentially the same. By the definition of (j)^ in 
([TBI , the right tails of both <fi and ip are bounded by exp(— x/3). On the other hand, as x — ► 00, 
p ±1 (x) x exp(±i/x/a- ni 7v) with vjo n ^ < l/cr n> jv = O (./V -2 / 3 ). These two facts suffice to show 
that both l^l 2 ^ 1 and | , (/'| 2 /9 = ' :1 are integrable over the region [x',oo), at least when N is large. 
Condition (2) is hence satisfied. 

By (|37p . the operator class A in (|2ip is now concrete. We now make valid all the for- 
ma l derivation in Section [3] by verifying that K t ,Kgoe G A. Observing that r is linear, 



by iReed and Simonl ()1980l . Theorem VI. 22(h) and Theorem VI. 23), condition (2) on p implies 
that K T — K e G A. The super exponential decay (152|) of the Airy functions, together with the 
same theorems as above, guarantees that Kqoe — K e G A. Hence, we need only to verify that 
e : L 2 ([s' , 00); por) ^ L 2 {[s' , 00); p~ l o r) is Hilbert-Schmidt, which is an immediate consequence 
of condition (1) on p. 

From now on, we use p to denote por in (I37j) with no ambiguity, for all the remaining discussion 
in this paper focuses on the s-scale. 

For the operator-theoretic bounds, by our choice of p in ([37|) . we could adapt Lemma [2] and 
Lemma [3] into a more convenient form as follows. 
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Corollary 1. With p as specified in (|37p . for v < ij/2, we have 



£>{p ) [v] — ex P {~ 7 1 S ' i — — ex P ( — ^-s' =t ^l 3 '!) • 



(38) 



In particular, under the assumption of Lemma\j^ if{pi,P2} C {/?, p x } cmdai — ai, bi— /?i > ^, 



then 



(39) 



a\ + 61 

where C = C(a\ , a\ , b± , 0i) . 

Under the assumptions of Lemma\^ if {pi,P2} C {p, p" 1 } and — a±,— 0\ > ^, i/ien 

||Z>|| 2 ,||D||i <Ca o o exp[(a 1 + 1 )s / + v\s'\] , (40) 

where C = C(a\,0i). 

The proof of (|38p follows directly from the derivation in I Johnstone! (|2007l . p. 50); see, in par- 
ticular, Eq.(205), (206) there. Then the operator bounds (j39j) and (HUj) are obtained by plugging 
(1381) into the bounds in Lemma [2] and Lemma [3j 



4.2 Operator convergence 

With the tools from the previous subsection, we work out here entrywise bounds for each 5 term 
given in the decomposition (12911 . 

6 R term. Using the o operator, we have 5 R = L[S T — Sa n ] with S T = 4> T o ip T + Vv o 4>t an d 
Sa n = Gn 0G + G0 Gn- We shall use the abbreviation D^f, k = —1,0 and 1 to denote sf, 
f and /' respectively. Regardless of the signs, we have the following unified expression for the 
entries of 5 R : 

6% = D( k \cj) T - G N ) o D®i/> T + D^G N I>W(^ T - G) 

(41) 

+ D^(tp T - G) o D®<t>T + D^GoD^(<j) T - G N ), 

for i,j G {1,2}, fc G { — 1,0} and / G {0, 1}. By Lemma[T]and asymptotics of the Airy function 
[see (|52j) ]. we find that for any of the four terms in (|4ip . the condition (|36[) is satisfied with 
a o = 00 = 1) a i = 01 = 0) °i = bi = 1 and ao, 60 as shown in the following table. 





a 


bo 


D^(4> T -G N )oD^i; T 


C(s )N~^ 


C(so) 


D^Gn D^(ip T - G) 


C(s ) 


C(s )N^ 


D (k \i) T -G)oD^4> T 


C(s )N^ 


C(so) 


D^GoD^((P T -G N ) 


C(s ) 


C(s )N^ 



We apply Corollary Q] and obtain that for v < 1, 

\\S§\\ < C(s )N- 2 / 3 exp (-2s' + u\s'\) . (42) 

Here and after, the unspecified norm || • || denotes Hilbert- Schmidt norm || ■ ] [ 2 if i 7^ j and trace 
class norm || • ||i otherwise. We remark that by a simple triangular inequality, we could choose 
the C(so) function in the last display as the sum of products of continuous and non-increasing 
functions, which could be seen from the term (aoA) a o&o)/( a i + b\) in (|39p . Moreover, the term 
C in (|39p is a universal constant for fixed a\,ot\,b\ and 0\ here. Hence, the final C(sq) function 
remains continuous and non-increasing. For the other 5 terms, we will have the same result by 
the same arguments and hence will be omitted. 
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Sq term. We reorganize 5q as 



-L[ip T <g> £4> T - G <g> iG N ] = -L[ip T ® e{(p T - G N ) + (ip T - G) <g) eG N ] 



cF.l . X F,2 



The entries of 5, 



F,i 
' 



1, 2 are all of the form a(s)/3(t) with the multipliers chosen from D^ijj T , 
D^ k \(j) T - G N ), £> (fc) (Vv - G) and D^G N for k E {-1,0, 1}. For these multipliers, the condition 
for Lemma [3] holds with the constants a± = /3± = — 1 and ao (or (3$) specified below. 





a (or fa) 




C(s ) 


DW(<t> T -G N ) 




DW(i-G) 


c(s )N-*r* 


dwg n 


C(s ) 



We apply Corollary [T] for these rank one terms and obtain that for v < 1, 
K^W < l|<£l| + llO <C(s )N- 2 / s e W (-2s' + u\s'\). 

6f and 5% terms. For these two terms, we have 



(43) 



<8> Pn - G® -±= and 5% = L 2 Pn ® 4>t ~ -jn ® G 



V2 



By their similarity, we take Sf as example and the same analysis applies to 5% with obvious 
modification. For 5f , we reorganize it as 



8f = L x ^ T -G)®p N + G®(p N -^) = 6(> 1 + S( 



rFA , rF,2 



For analysis of the terms here, Co rollary Q] no longe r works and we give an alternative bound 
which was derived in full detail in Ijohnstond (2QQ3). In particular, consider matrices of rank 
one operators on L 2 ([s' ,oo); p) (g) L 2 ([s', oo); /9 _1 ), we denote, here and after, the L 2 -norm on 
L 2 ([s' ,oo); p) and L 2 ([s', oo); p" 1 ) by || • ||+ and || • ||_ respectively. Ijohnstonel (|2007l . Eq.(214)) 
gives the following bound 



|an (S> fen || i ||ai2 <8> 612II2 

1^21 <8> 62I lb 11^22 ®&22||l 



< 



l a ll|| + ll fo ll| 
l a 2l||-||&2l| 



|oi2|| + ||&12| 
«22 - &22 



By the inequality above and our reorganization of 5f, we will see that the essential elements 
we need to bound are \\D^ k \tp T - G)||±, ||£> (fc) G|| ± and ||1||_ for k = —1 and 0. 
For \\D^(ip T — G)||±, we obtain from Lemma[T]and (|38p that for v < 1: 

pW^.G)!!! <C 2 ( So )Ar- 4 /3 £(/0 ±i) [2 ] < c 2 ( So )A r " 4/3 exp(-2 S / + z/| S / |). 
For ||Z)WG||-)-, asymptotics of the Airy function and (|38|) give that for v < 1: 
pW G ||2 < C 2 (so)£(p ±i )[2] < c 2 ( S0 )exp(-2 S , + z,| S '|). 

Finally, for ||1||-, we derive directly that 

[1 + exp(z/|s|)] -1 ds < / exp(—v\s\)ds 

J Js' 

POO r0 111 2 

< / exp(— us)ds + / exp(^s)ds = — I exp(— ^|s'|) < — . 

Jo J-\s'\ v 



v v 
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By definition of the operator L\ and our reorganization, we have the first column of 5f as 
following while the second column of it are zeros: 

SftA = ( (Vr - G) ® (3 N + G ® (p N - l/y/2) \ 
<W V-£(^r -G)®0 N - eG ® (fi N - l/s/2)) ■ 

Assuming [3n — l/\/2 = O (-^V -1 ) [for a proof, see lA.lj . we have 
||<lll|l <||(VV " G) <g> /JjvUi + ||G (/fo - 1/V2)||l 

<||ty T - G)\\ + \\p N \\- + l|G||+||ftv - l/yfN\\~ 

<C(s )iV~ 2/3 f~ 1/2 exp (-a' + v\s'\/2) + C^N^y' 1 ' 2 exp (s' + u\s'\/2) 

<C(s )A r " 2/3 ^" 1/2 exp(-s / + z/|s / |/2) < C(s )N~ 2 / 3 exp(-s'/2). 

The last inequality holds by fixing v, for example, at 1. By the same calculation, this bound also 
holds for ||<5f 12 ||2 an d those entries of <5f\ Finally, we conclude our analysis with the following 
bound on entries of 5f and <5^: for v = 1, we have 

fell, fell < C(s )iV- 2/3 exp (- S '/2) . (44) 

4.3 Proof of Theorem H 

Throughout the proof, we fix 

i/ = 1 (45) 

in the weight function p specified in ([37|) . 

By d2SD and the bounds ([12]) . (03]) and (03]), we bound the entries of iT T — Kqqe using a 
simple triangular inequality 

||#T,ij " K GO E,ii\\ < C(s )N- 2 / 3 exp(-s'/2). 

Apply Proposition [1] with A = K T and B = Kqo e , 

|det(/ - K T ) - det(J - if G oB)| < M(K GO £)C( So )iV" 2/3 exp(- S '/2), (46) 

where 



M{K GOE ) = 2 det(/ - ET GOj e) + 2 exp 



2 (1 + Hi^GOElb) 2 + ^2 W K GOE,iih 



For the first term in M(Kqoe), we have det(I — Kqoe) = Fi( s ') < 1- O n the other hand, 
we have 

\\KaOEh < W K GOE,ijh < W K GOE,iih + \\ K GOE,ijh- 

In principle, one could show for each % and j 

\\K GO E,ij\\ < C(so), 

with C(so) continuous and non-increasing. Here, we only take \\Kgoe n||i as an example for the 
proof of the others is essentially the same. Let H r and G T be Hilbert-Schmidt operators with 
kernels (p T (x + y) and ip T (x + y) respectively, then as operator 

Kgoe,xx = H T G T + G T H T + G®-^=-G®eG. 

v2 
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By the relation ||AB||i < ||A|| 2 ||-B||2, 

||-^goe,h||i < 2 \\H T H2IIGVH2 + — 7= 1 1^-^11 1 1 1 II 2 s/ o — 1 + H^lkplleGl^p-i. 

Each norm on the right hand side of the above inequality is the square root of an integral of a 
positive function on [s',00) or [s 1 , 00) 2 that is bounded by the corresponding integral over [so, 00) 
or [so, oo) 2 , which in turn is continuous and non-increasing in sq. Hence, ||i^GO£,n||i < C(so). 

By the above discussion, we could control the second term of M(Kgoe) and hence itself by a 
continuous and non-increasing C(sq). Finally, we complete the proof by combining this fact with 
the initial bounds (|46l) and p2l). 



5 Laguerre Polynomial Asymptotics 



In this section, our goal is to establish Lemma [TJ To this end, we exploit the Liouville-Green 
approach to study the related asymptotics for Laguerre polynomial s of b o th large order and 
large degree. This approach has bee n successfully used in Ijohnstonel (|200lh . IeI Karouil (|2006bh 
and more recently, Johnstone ( 20071 ) in deriving similar type of results. The novelty here is the 

establishment of the bounds ([33]) and ([M]) for the derivatives of these polyno mials. 

To start with, let us consider the "intermediate" function F n> N introduced in lEl Karoui (l2006bl . 
Section 2.2.2) as 

F n , N (x) = (-l) N a-% 2 V^ ] -x {aN+1)/2 z- X,2 L a N N (x) (47) 



with ocn 



n 



N. We could then relate -F n ,iv to 4>n, 4> and cj) T as 
4> N (x;a N ) = (-1) N a l J 2 N x~ 1/2 F ritN (x), 



{x;a N - 1) 



Afl/4( n _ 2)1/4 



V2 



1 /2 

<r n -2N F n-2,N(x)/x, 



and 




Hn-2,N 



F n -2,N(fin,N + Sa U} N) 



fJ-n-2,N 



with Hn,N and a n: N defined as 



and a n n 




using the abbreviations n + = n + ^ and N+ = N + ^. If we replace the subscripts (n — 2,N) 
in /J, n -2,N,c r n-2,N and F n _2,N by (n — 1, N — 1) on the right hand sides of the expressions for 
4>(x;ctN — 1) and 4> T (s), we obtain the identities for ip{x;aN — 1) and ip T {s). Due to this close 
connection of (fi T and ip T to F n ^, the essential element for proving the desired asymptotic bounds 
reduces to the understanding of the behavior of F n x and its derivative, for which the Liouville- 
Green approach is instrumental. 

In the rest of this section, we first study in detail the Liouville-Green approximation to the 
F ni N function and its derivative. Then the result is used to facilitate the derivation of the global 
bounds and the local as well as global Airy approximation to (f> T , ip T and their derivatives. 
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5.1 Liouville-Green approach 



Ma ny of the a r gumen ts in this part have been spelled out in some detail in iJohnstond (120011) 
and lEl Karoul (|2006bh . A more complete account of the theory could be found in lOlverl (|1974| . 
C hapter 11). Howev er, for completeness, we state them here briefly with notation similar to that 



m 



El Karouil ()2006bh . 



Consider wn(x) = x ( aN+1 ^ 2 e X ' 2 L N N (x) as a multiple of F n ^, we have 



d 2 WN 

dx 2 



1 _ kn - 1/4 

A * O 



x 1 - 



W N 



(48) 



with k n = N + SiLtl = ^±f±i and \ N = Sf = S^S. 
By a change of variable £ = x/k^ , we obtain 

d 2 WN 



where 



no 



d£ 2 



4£ 



{K 2 f(0+9(0}™N, 

and #(£) = 



1 

4^2 



with £± = 2 ± y4 — and u;^ = 2\n / 
the change of independent variable as 



C 3/2 = ^v7W^ (£>£+) and \{-Cf 12 = V^Mdt (£<£+), 



^tWtt. The Liouville-Green method introduces 

n+iV+l 



and defines a new dependent variable = (dQ/d£) l / 2 WN- For the new pair (W, Q, we have the 
new differential equation as 

= {^C + «(wiv,C)}w: 

Let / = f/C, the recessive solution of (08]) satisfies ( Olverl . E974 . p. 399, Theorem 3.1) 

w n (k n oc /-^(eKAi^C) + £ 2 («jv,0}. 
with the following estimates for the error term £2 and its derivative with £ £ [2, 00): 

Ao 



<m(4 /3 c)e- i (4 /3 o 
\dse 2 (K N ,o\ < 4 /3 / 1/2 (e)N(4 /3 c)E- i (4 /3 o 



exp — F(oj n ) - 1 
Ao 



exp — F(wat) - 1 



In the above bounds, M, E are the modulus and weight functions for the Airy function, and N 
the phase function for it s derivativ e ( Olverl . 1974 . pp. 394-396). Moreover, Ao = 1.04 and F(u>n) 
has been well studied in lEl Karouil (j2006bl . A. 3). 



For the function F n w of our interest, we have from lEl Karouil (|2006bl . Eq.(5) and A.l) that 



F n ,N(x) = r N 



K N 



1/6 



a 



r l / 4 (e){Ai(4 /3 o+e 2 (^,o}, 



n,N 
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with 



r 2 



27rexp[-(n + + A+)]n"+ A+ 4 



l + 0(n ,A^) . 



For the convenience of argument, we define an auxiliary function Rn(0 = (C(£)/C/v) 
6v = C(£+)- We remark that by our definition, we have a n ^ = (k^' 3 Ov) -1 an d / 
Hence, F n jv could be rewritten as 



F n . 



N 



r N R N {i){A\{4 Z C)+e 2 (K N ,i)}. 



Finally, we conclude th is part with some useful bounds and asymptotics of M, E,N 
Airy function dOlverl . Il974l pp.392-397). As x — > oo, we have 



(49) 
x / 2 with 

= C(0 2 - 

(50) 
and the 



E(s)~ V2e§ x3/2 , M(x) ~ ir-^x- 1 / 4 , N(x) ~ tt'^x 1 / 4 
For all x > 0, the Airy function and its derivative are bounded as 

7 



< Ai(x) < 



_2 T 3/2 

e z 



|Ai'(x)| < 1 + 



48x 3/2 



x l/4 e 3 a; 3/2 



2vr 1 /2 



2^/2^1/4' 

Finally, for all x, we have the following bounds 

|Ai(x)| < M(.x)E- 1 (x), |Ai'(s)| < N(x)E" 1 (x), M(x) < 1, E(x) > 1, 
and finally, E(x) is monotone increasing in x ( Olverl . 1974 . p. 395). 



(51) 



(52) 



(53) 



5.2 Large N asymptotics 

We now derive the large ./V as ymptotics o f t/y, ?/y and related functions. First, we use the analysis 
done in Johnstone ( 200ll ) and El Karoui ( 2006bl ) to obtain bounds for \tp T \ and \ip T — G\ without 



much extra effort. Then we derive the bounds for \ip' T \ and \tp' T — G\, which need some careful 
analysis to be detailed below and the bound on \ip T — G\ is then further refined to match the 
claim in Lemma [TJ Finally, corresponding results for quantities related to cp T could be obtained 
by understanding the difference of the centering and scaling constants involved in <j) T and ip T . 

5.2.1 Bounds for |Vv(s)| and \i/j t (s) - G(s)\ 
We define x n> N(s) = ^ n ,N + s&n,N an d let 



1n,N{Xn,N{s)) = F n , N (x n ^ N (s)) 



Xn,N(s)J 



Johnstond (120011 . A. 8) showed that under the condition of Lemma [U 

F^x^Wv^N- 1 ^ < Cexp(-a), for all a > 0. 

Simple manipulation gives c^^iV 1 / 6 < <r n ^ 2 A^' 6 < {N + /n + ) 1 ^ 2 < 1, and hence for all s > 0, 

\F n ,N(x n ,N(s))\ < Cexp(-s). 
If s < 0, by flMD, ||5I1), C2D and IeI Karoul (|2006bl . A.3), we obtain that when N > N (s ,j), 
\F n , N (xnAs))\ < r N \Rn(0\ M(4 /3 C)E" 1 (4 /3 C) < 2 E-\k% 3 < 2. 
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If we let M(sq) = max s6 r SQ:0 ]{2e s }, and define 

C(s ) = max{C,M( So )I So <o,M(0)}, 

it is then continuous and non-increasing in sq as desired and we have that when N > Nq(so, / j), 
\F n ,N{x n ,N(s))\ < C(s ) exp(-s) for all s > s . Moreover, by noting a niN /fj, n>N = 0(N~ 2 / 3 ), 
when N is larger than some constant that depends only on sq, 



X n ,N( s ) 



< 1 + s 



Cn,N 
' (J«n,N 



< 2, for all s > sq- 



Hence, under the condition of Lemma[H we have that when N > iVo(so,7), 

\6 n ,N(xn,N(s))\ < C(s ) exp(-s), for all s > s . 

Later on, El Karoui (|2006bl . Section 3.2) showed that for any constant qn = 1 + O (iV x ), if 
we define A Uj iy(x n ^(s)) = \gN^n,N(x nj iy(s)) — Ai(s) | , then under the condition of Lemma [H we 
have 

iV 2/3 A n jv(x n jv(s)) < C(s ) exp(-s/2), for all s > s . 



For tp T (s), we have p, n> 



N 



fJ"n-l,N- 



-i and a. 



n,N 



a n - \ n-1 and hence it is of the form 



-^PN^n-i,N~i{x n ~i,N-i{s))- Noting that = 1 + O [N x ) [see I A. II for a proof], we apply 
the bounds for 9 n ^N and A nj Ar directly and obtain that under the condition of Lemma CD, when 

N>N {s ,-f), 



|^ T («)| < C(s )exp(-s), and |Vv(s) - G(s)\ < C(s )N~ 2/3 exp(-s/2), for all s > 



so- 



Actually the bound on |Vv(-s) — G(s)\ could be further improved to be that claimed in Lemma 
[TJ see (|59p for the refinement. We also remark that we could not apply the results directly to <p T 
since the centering and scaling constants {pn-2,N-,&n-2,N) specific to F n _2,N does not agree with 
the global constants (p n ,N , &n,N) which we use. 

5.2.2 Bounds for |<(s)| and \ip' T (s) - G'(s)\ 

As we have seen, the analysis of ip T depends on our understanding of the function O nt N(x n ^(s)). 
To investigate the bounds for ip' T and its approximation by G' , we start with a detailed analysis 
of the quantity d s 6 n! N(x n ,N(s)). 

We split d s 6 nt N(xn,N(s)) into two parts: 



\d s dn,N(Xn,N(s))\ < 



&n,NFn,N( x n,N{s)) 

Tn,i(s) + T n , 2 (s). 



Xu,n(s) 



+ 



Pn,NFn,N(x n ,N(s)) 



<7V( S ) 



T/v,2 term. This term is relatively easy to bound. Note that Ta^s) = \O n ,N(x n ,N(s))a nt N / 'x n ^]y(s)\ 
and that a n ,N / Hn,N = O (N~ 2 / 3 ). When N > N {s ), the ratio 

Wn,N/x n , N (s)\ = \s + Pn,N / <?n,N\~ l ^ C(s Q )N~ 2/3 , for all S > S . 

Hence, by our previous bound on jv|, we obtain that under the condition of Lemma [TJ 

T N ,2( S ) < C{s )N~ 2/3 exp(-s), for all s > s . 
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T/v,i term. Recalling that fi n ,N/x n ,N(s) could be bounded by 2, we focus on <r rii NF' n N . Thinking 
of x = x n ^(s), we have from ([50]) that 



a n,NF' nN {x) = r N 



(?n,N 



K N 

+ r N R N (0 



i4(0 [Ai(4 /3 0+e 2 ( Kjv ,0 



Ai'(4 3 c)^ 2 (0 + 



k n J di 



:£2(k!V,0 



To facilitate our analysis, on the s-scale, we divide the whole region [sq,oo) as I\n U /2,7V with 
h,N = [so, SiN 1 / 6 ) and I 2 , N = [siiV 1 / 6 , 00). The choice of si is made explicit in IA.4I 



Case s E Ii,n- m this case, we first reorganize cr n ^F' n N (x) as a n ^F' n N {x) = Y2i=i^n 



with 




u n,N 


= r N '— R 


V UN J 


D 3 





R'^OiH^TO +e 2 (KN,$}, Dl N = rvlR^iO - l]Ai'(4 /3 C), 



2/3, 



D n,N = r N 



( ^-\ R n (£)-^- £2 (kn,0- 
k n J d£ 



(i) Consider D x n N first. Direct computation shows N 2 / 3 (a niN / 'k n ) — > 2(l + l/ A /7) 1 / 3 (l + > /7)(l + 
7) — 1 . Hence when N > Ao(7), we have the bound 



AT?/ 3 



Moreover, by the bound (|72p for R' N and recalling that 7 > 1, we know that when A" > Nq(sq, 7) 

1/3 



AT 2 / 3 



(^) K(6 | £C ( 1 + _L) _ 



< C, for all sell 



N- 



(54) 



On the other hand, by (f5"T|) and (f74"]) . we obtain 



Ai(4 /3 C) + £ 2 («jv,0| < CM(4 /3 C)E- 1 (4 /3 < CB~\4 3 (). 
When s > 0, we know from (|74p and the monotonicity of E that when A" > Nq(sq, 7), k^C, > s/2 



holds, and hence by (|5Tj) . 

Ai(4 /3 C) + e 2 (^,6| <CB~\s/2) <Cexp (--L s 3 / 2 ^ < C exp(-s). 

If s < 0, for all s G [s ,0], we obtain from {74]) that when N > A^ (s ,7), 4^ G [3s /2,1]. 
Hence, we have 

E" 1 (4 3 C)exp(s) < C(s ) = max eE -1 (a), 

se[3s /2,l] 

the right hand side of which is, by its definition, continuous and non-increasing. Therefore, we 
conclude that when A" > A r o(so,7), 

9/0 

Ai(«;^ C) + £2{Kn,Q < C(s )exp(-s), for all s G /i,jv- (55) 

Finally, putting the bounds ([54"]) and ([55]) together and recalling that |rjy| could be bounded 
by 2, we obtain that under the condition of LemmaH] when N > A"o(so,7), on I^jv, 

\Dl N \<C(s )N~ 2 / 3 eM-s). 
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(ii) For D 2 N , we first split and control \r^R N (£) — 1| as 

\r N R^(C) - 1| < rjy - l| + \r N - 1| = ^1^(01" ^iv(0 - 1| + kiv - 1|. 

By d!| and flM}, when iV > iV (so,7), w e have |rjv| < 2, l^^)^ 1 < 2 and hence 

|nv^w(£) - 1| < CN~ 2 / 3 s + CA^ 1 < CW~ 2/3 s, for all s G 7 1)JV . (56) 
On the other hand, by (USD, we obtain 

Ai'(4 /3 o| < n(4 /3 c)e^(4 /3 o- 

When 5 > 0, we have from (f74l) that C ^ 3s/2], and using (f5T|) , we obtain 
N(4/ 3 C)E" 1 (4 3 C) < C^O^exp ("^f 3/2 ) < Cs^e W ("^/f 3/2 ) < C exp(-3 S /2). 



2/3 

If sq < 0, we know that when A?" > A^(so,7j, k jv C ^ [3so/2, 1] for all s G [sq,0]. We then 



have 



N(4 3 C)E(4 3 C)exp(3s/2) < C(s ) = max e 3/2 N(s)E(s), 

s6[3s /2,l] 



_2/3 



the right hand side of which is again continuous and non-increasing in so- As before, this enables 
us to conclude that when N > iVo(soi7)> 

Ai'(4 /3 C) C(s )exp(-3s/2), for all s G Ii,n- (57) 

Assembling (|56p and ()57|) . we obtain that when N > iVo(so,7), 

|-Dn,Jv| < C(s )A r " 2/3 |s| exp(-3s/2) < C(s )A^ 2/3 exp(-s), for all s G 7i )jv . 

(mj For D^^, recalling that = 1 + O (A^" 1 ) and we obtain the following bound under the 
condition of Lemma[T]by using the previously derived bound on Ai' (k 2 ^ 3 (,)'■ 

\Dl >N \ < C(s )exp(-s). 
(iv) For N , by the definition of Rn and Cn as well as the bound for d^S2 (kn, £); we have 



raJV 



rNRN(C)d^ 2 (K N ,S,) 



<CiV- 2 / 3 f T ni7 v^ 1/3 r J v^(0C(0N(4 /3 C)E- 1 (4 /3 C) 

= c^ 2 / 3 r^ 1 (e)N(4 3 c)E- l (4 /3 c)- 

All the terms involved in the last bound have been well studied during our analysis of -D 2 N , and 
applying various results established there, we obtain that when TV" > Nq(so,^), 

\D^ N \ < C(s )A r_2/3 exp(-s), for all s G I 1>N . 

Combining the bounds for the four terms, we obtain from a simple triangular inequality that 
when N > iVo(so>7)> 

T N ,i < C(s ) exp(-s), for s G h, N - 
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We remark that, here and after, we derive a more stringent bound with the rate term iV~ 2 / 3 
whenever possible. Although it is not necessary here, those bounds with this rate term will become 
useful in the later study of \ip' T (s) — Ai'(s)|. 

Case s £ I 2) n- In this case, we define D\ N = D\ N and t) 2 nN = D 2 nN + B\ N + D^ N . 
(i) To analyze the t)\ N term, we first introduce a useful lemma: 

Lemma 4. Let r > be fixed. For x = xjy(s) = n n ,N + sa ni N and £ = x/kn, when s > r 2 , we 
have 

<r n ,N\/ 7(0 > r W£ = rHn,N/(fJ-n,N + S(J n , N ). 



For D\ n , we could bound it for large N as 



D 



nJV 



<Cr N 



Q~n,N 
K N 



R' N (0 



Rn(0 



i? Ar (OM(4 /3 c)E- i (4 /3 o 



We consider first the Rn (?)M(4' 3 '0 term. Recall that Rn(0 = ^/v^niv 7~ 1/4 (0 and that 
|M( K iV 3 0l — C^^^C -1 ^ 4 when N is large. Applying LemmaHl we obtain that when N > ^0(7), 

^(OM(4 /3 o < cc^a-fr 1 /^) = cr xl \z)°- x j? 



< Cr" 1 ' 2 ( f^L 

\Hn,N + Sa nj N 



-1/2 



< Cs, for all s E h,N- 



We remark that our choice of s\ ensures that si > r 2 with r = 1. 

Switching to the term \R' N {^) / R^{0\i from the definition, we have 



r' n (o _ ceo and c(o _ no vfW) 



Rn(0 2C(0 



C(0 2/(0 3I(v / 7) 



where I(yff) = y/J. Simple triangular inequality gives a direct bound as 



R'n(0 



Rn(0 



< 



no 



no 



+ 



177(0 



For the first term on the right hand side, simple manipulation gives us 



no 




f(0 





1 1 

+ 



Moreover, we could bound (£ — £+)[v / 7/-f(v / 7)] as 



< 



4«jV < Kjy 



(e - £+) vTlo < (g - £ + ) 3/2 (£ - g-) 1/2 < 



(e - £ + ) 3/2 



J(v7) 



dt 



< 



6 



2(1 - £_/£+) - e + - c_ - 4 



2n\ 

< T 1 + -^ 



A" J 



Hence, when N > Nq^), we obtain the bound for \fj /I{\fJ) as 



\/M < 3 A + W*^ < 3 / 2nA _^y_ 1 1/6 



0~ n ,N 
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This implies that \R' n (£)/Rn(£)\ is bounded by Ckn /a n ,N which further ensures 



r N 



<?n,N 
K N 



Rn(0 



< c. 



Finally, using (|T5j) and the fact that s\ is a fixed constant, we obtain that when N > Nq(j), 

Dl yN < Csexp(-3s/2) < Cs _4 exp(-s) < CA^ 2/3 exp(-s), for all s e I 2 , N - 
(ii) For D\ n , we first recall its definition as 



Dl N = r N R N (£) 



Ai'(4 /3 o^ 2 (e) + 



k n J d£ 

By definition of Rn and the large N bounds on r^y, 8^£2(kn, £) and Ai', we have 



D 



n,N 



<c^ 1 (e)N(4 /3 c)E- l (4 /3 c). 



The asymptotics of the phase function N suggest that 



i#(0N(4 /3 c) < cr^(o4 6 c 1/4 = cf l '\t)<T, 



1/6. 



,1/2 
n,N- 



For o"n,JV\//(0> we could simply bound it as 



^v7(0 = WgEpEM < ^. 

Observing that for s G h,N, &n,N we obtain 

R-'m^To < c^/ 1/4 (e) < < 

Once more, by (I75p and our choice of si [see lA.4j . we obtain 

Dl, N < Cs 2 exp(-3s/2) < Cs- 4 exp(-s) < C N~ 2 ' 3 exp(-s). 

This finally gives a bound of the form C N~ 2 / 3 exp(— s) for Tjvi on /2,7V- 

By a simple triangular inequality, we combine our bounds on T^y and T/v 2 on both /1 jy and 
-^2, at together and obtain that under the condition of Lemma[TJ when N > Nq(sq,^), 

\d s 9 n)N (x ntN (s))\ < C(s ) exp(-s), for all s > s . 

Bound for |^(s)|. We have pointed out that ip T is of the form -^pNd n -i,N-i( x n-i,N-i{ s )) 
with pn = 1 + O (iV _1 ). Hence, we have ip' T (s) as 

if/ T (s) = -j=p N d s 6 n -i iN -i(x n -i iN -i(s)), 

for which our bound on cr n ^d s O nt j^(s) apply directly and we obtain that under the condition of 
Lemma [U when N > Nq(sq,^), 



|<( S )| <C( S0 )exp(- 



for all s > sq. 
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Bound for \i/j' t (s) — G'(s)\. By the expression of ip' T , we could split \ip' T (s) — G'(s)\ as 



\i/)' T (s) - G'(s)\ <-j=\p N - 1| |(9 s 6' n _i ) jv-i(a: n _i ) ^_i(s))| 

+ —= \d s e n -i :N -i(x n - ltN -i(s)) - Ai'(s)| . 



(58) 



By our bound on \d s 9 n -i > N-i(x n -i > N-x(s))\ and recalling that = 1 + (iV _1 ), the first term 
is then bounded by C{sq)N~ 1 exp(— s). We focus on the quantity \d s 9 nt N(x nt jy(s)) — Ai'(s)| to 
bound the second term. 

We split the quantity of interest into two parts as the following: 



\ds6 nt N(Xn,N(s)) - Ai'(s)| < 



On,NF' n N {x n ,N(s)) ^ n ' N - Ai'(s) 
X n ,N{S) 

75v,i(s) + Tn,2(s)- 



+ 



&n,NFn,N(x n ,N(s)) 



<7V( S ) 



The Tn^s) term is exactly the same as T/v^s) defined in the previous study of 9 s ^ n ,/v(^n,jv(s)) 
and hence we quote the bound derived there directly as 

T N , 2 {s) < C{s )N~ 2/3 exp(-s), for all s > s . 

Switching to the Tn } i(s) term, we divide the whole region into the two disjoint intervals 
^i,jv = [so, siN 1 / 6 ) and i2,iv = [siiV 1 / 6 , oo) again. 

Case s £ Ix,n- Exploiting a similar strategy in splitting <J T i,NF' n N (x), on Ii t N, we decompose 

T NA (s) as T N)1 (s) = J2t=i v n at> witn V nN = D n N^n,N /x n , N (s) for i = l,2 and 4, 



V 3 



n,N — r N 



Xh,n(s) 



Ai'(4 /3 C) - Ai'(s) 



and V 5 



nJV 



x u ,n(s) 



Ai'(a). 



For i = l,2 and 4, using our previous bounds on D % n N and noting that \n n ,N /x n ,N{s)\ could 
be bounded by 2 on Ii n, we obtain directly that, when N > A^o(soj7), 

|^,jv| < C(s )N~ 2/3 exp(-s), for i = 1,2 and 4, and all s £ I 1)N . 

For 2? 3 N , by a first order Taylor expansion and the identity Ai"(s) = sAi(s) for all s, we have 
that, for some s* in the middle of an d s, 

Hn,N 



V: 



n,N\ = r N 



Xn,N{s) 



\s*Ai{s* 



2/3 1 
K N C, S 



< C N- 2/3 s 2 \s*Ai(s*)\ , 



where the inequality holds when N > Nq(so, 7) and comes from (|74p and the large N bounds for 
r N and p n ,N/x n ,N(s). 

2/3 

When s > 0, we know from the definition of £ that ti^ £ > and hence s* > 0. Moreover, 

(|74p implies that when N is large, an d hence s* will be greater than s/2. Thus, by (|5ip and 

the monotonicity of E, we obtain 

|s*Ai(s*)| < CsE _1 (s/2) < Csexp ^-^=s 3 / 2 ^ < C exp(-3s/2). 
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If so < 0, as before, we consider all s G [so,0]. Once again, we obtain from (|74p that for large 
N, K% 3 ( G [3s /2, 1] and hence s* G [3s /2, 1]. Then for all s G [* ,0], when AT > JVb(a ,7)» 

|s*Ai(s*)| exp(3s/2) < C(so) = max 

se[3so/2,l] 

This C(so) is continuous and non-increasing in sq. 

Thus, we could conclude that when iV > Nq^sq,^), for all s G Ii Ni 

\T>l >N \ < CN~ 2 / 3 s 2 \s*Ai(s*)\ < C(s )A^" 2/3 s 2 exp(-3s/2) < C(s )iV" 2/3 exp(-s). 

In f^^v, recalling a n ,N / Hn,N = O (_/V~ 2 / 3 ) and r^r = 1 + 0(A^ _1 ), we have that, when 
N > N (s ,~f), for all s G I^n, \s + li n ,N/vn,N\ > \{^n,N / (?n,N) and hence 



'"iV 



Xn,N(s) 



1 



1 



so + 



Mn,JV 



Cn,N 



+ |rjv - 1 
-1 



< r N 

< r^vls 

For Ai'(s), by (|51j) and (I53j) . we obtain directly that 

|Ai'(«)| < C(s )|s| 1/4 exp 
where C(sq) could be chosen as 



max |Ai'(s)| (1 + lsl 1 / 4 

sG[so,oo) 



rN\s\ 



-1 



+ Vn ~ 1| 



+ \r N - II < CiV- 2/3 |s| + CAT 1 . 



.2 s 3/2 



cxp 



2 s 3/2 



which is continuous and non-increasing. 

Putting two parts together, we obtain that for all s G ii jv, 

Kv| < C( S0 )A^- 2/3 +C7V- 1 / 3 ) Isl^exp (-§* 3/2 ) < C( So )iV- 2/3 exp(-s). 

We could then assemble all the bounds on T> l n N using the triangular inequality and conclude that 
under the condition of LemmadJ when N > Nq(sq,^), 



T n ,i(s) < C(s )N~ 2/3 exp(-s), for all s G I x , 



Case s G I2.N- In this case, we could act more heavy-handedly. In particular, by the asymp- 
totics of Tjv,i(s) on l2 t N and the asymptotics of Ai', we have 



r NA ( s ) < 



Vn,NF' n ^ N {x n . N (s)) 



X n ,N(s) 



+ |Ai'(s)| < CA^ 2 / 3 exp(-s) + Cs 1 / 4 exp(-3s/2) 



< C7A- 2 / 3 exp(-s) + CA^- 2 / 3 s 4+1 / 4 exp(-3s/2) < CN~ 2 ' 3 exp(-s). 

We then obtain the bound C(so)A -2 / 3 exp(— s) for 7jv,i(s) and hence also for \d s 9 nt N(x nt N(s)) — 
Ai'(s)| for all s G [sq, oo). Applying the bound to the second term in ([58]) . we obtain that under 
the condition of Lemma[H when N > Nq(sq,^), 

|<(s) - G'(s)\ < C(s )N~ 2 / 3 exp(-s), for all s G [s ,oo). 
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Improved bound for \ip T — G\. The above bound on \ip' T (s) — G'(s)\ could be used to derive a 
more stringent bound for \if) T (s) — G(s)\ as the following: 

r2s 



\Ms)-G(s)\ 



«(t) - G'{t)]dt - ty T (2s) - G(2s)} 



< / \ip' T (t)-G'(t)\dt+\iP T {2s)-G(2s) 

J s 



(59) 



< f C(s )N- 2 / 3 e- t dt + C(s )N- 2 / 3 exp(-s) < C(s )N- 2 / 3 exp(-s) 
This is exactly the bound that we have claimed in Lemma [TJ 



5.2.3 Bounds for quantities related to (f) T (s) 



In this part, we employ a trick that was first used in iJohnstond (|200ll . p. 320) to derive bounds 
for quantities related to 4> T from those for quantities related to tp T . 
Recall that <\> T could be expressed as 

A i \ 1 ~ z? I I \\ V>n-2,N 

V2 X n _ijv_i(s) 



where p~N = 1 + (A^ 1 ) [see IA.1I for its proof]. The problem of <p T is that the centering and 
scaling constants (/J, n -i,N-i, Cn-l,iV-l) in the transformation x r i_i ) jv'-l(s) does not agree with 
the "optimal" constants (// n -2,JV, o"n-2,iv) for the related function F n _2,N- To circumvent this 
problem, we introduce a new independent variable s' as the following [one should not confuse it 
with the s' previously appeared in Section dj: 



Pn~l,N-l + S ^n-1,N-1 ~ Pn-2,N + s'a n _2,N- 

Then s' = (p n ~i,N-i ~ Hu-2,n) / <?n-2,N + sa n -i, N - 1 /a n -2,N- By defining 

Pn-1,N-1 — Pn-2,N 



A 



N 



&n-2,N 



(60) 
(61) 



we have s' — s = An + [o" n _i ) ^_iO" n _ 2 N ]s and r (s) could be rewritten as 

, I \ 1-77 I t >\\ ^ri-2,N 

(Pr{s) = - 1 =p N F n _2,N{.Xn-2,N{.S )) —fr. 

V2 X n -2,N{S') 

Before we proceed, we list two important properties as the following [with proof given in lA.lj : 



A 



n = O (iV- 1 / 3 ) 



and 1 < 



OVt-l.TV-l 
&n-2,N 



1 + o (a^ 1 ) 



(62) 



Bounds for |</> T (s)| and |0' T (s)|. Applying our previous bounds for \6 n ,N(x n ,N(s))\ and 
\d s 9 n ^(x n ,N(s))\, and using (|62|) . we obtain that under the condition of Lemma [U for all s € 

[s ,oo) 

\4> T {s)\ = —/=PN \0 n -2,N{x n -2,N{s'))\ < C(s ) exp(-s') < C(s ) exp(-s); 

1 1 ds' 

|^(s)| = — PAT |5 s 6 , „_2,7v(rE n _ 2 ,Ar(s / ))| = ^=/5tv 1 9 s '6> rl _ 2 ,Ar(2; rl _2,7V (s')) | 



V2' 



r/.s 



< C(so) exp(— s') n — 1 < C(so) exp(— s). 
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Bounds for |0 r (s) - G(s) - A N G'(s)\ and \<f)' T (s) - G'(s) - A N G"(s)\. We consider |</> T (s) - 
G(s) — AnG'(s)\ in detail and the derivation for \(p' T (s) — G'(s) — A^G" (s)\ is essentially the same. 

By our definition of s' and recalling that Ai"(s) = sAi(s), we have the Taylor expansion of 
G(s') as 



G(s') = G(s) + (s> - s)G'(s) + i(s' - s) 2 G"(s*) 



G(s) + A N G'{s) + 



V2 



&n-2,N 



1 



with s* lies at somewhere between s and s'. 

Hence, by our bound on \ip T (s) — G(s)\, we obtain 

\</> T (s) - G{s) - A N G'{s)\ < C(s )N~ 2 / 3 exp(-s') + CN^ |sAi'(s)| + C{s'-s) 2 |a*Ai(a*)| . (63) 

On [so,oo), we have C(sq)N~ 2 / 3 exp(— s') < C(sq)N~ 2 ^ 3 exp(— s) for the first term in the above 
bound. Moreover, by (f5Tj) and ([53]) . the second term satisfies 



CN~ l |aAi'(a)| < C(s )A r ~ 1 |s| 1+1/4 exp (-^ 3/2 ) < C^A" 1 exp(-s), for all s > s . 
For the last term, we split [sq,oo) into U h,N as usual. For s E /i,jv> when A > Nq(sq,^), 



(s - s') 



'\2 



A N + I ■ lis 



< 



CW-i/3 + CN^s < ( CN- 2 / 3 ) A 1 



&n-2,N 

We obtain from the above bound that \s* — s\ < 1 and hence by (|5ip and (|53p . 

C(s-s') 2 |s*Ai(s*)| < CA- 2/3 (|s| + l)E~ 1 (s- 1) < C(s )iV~ 2/3 exp(-s), 

where C(sq) could be chosen as max sg [ S0jOO ) Ce'E-^s- 1). 

On /2,iV) we have s' > s/2 from (|77p and hence s* > s/2. By (|5ip and (|53p . we obtain that 
when A > Ao(so,7), 

C(s' - s) 2 |s*Ai(s*)| < Cs 3 exp L s 3 / 2 ^ < CV 4 exp(-s) < CN~ 2 / 3 exp(-s). 

Therefore, we have shown that, for all iV > Ao(so,7) an d s > sq> the right hand side 
of (|63p is further controlled by C(so)A -2 / 3 exp(— s), which is exactly the desired bound for 
\Ms)-G(s)-A N G'(s)\. 



A Technical Details 

A.l Properties of (3 N ,p N ,p N ,A N and a n -. 1)N -i/ '(T n -2,N 
A. 1.1 Property of /3jy 

We are to show that 

0N = ^= + 0(i\T 1 ). 
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First of all, we recall that (f> T (s) is defined to be when r(s) = p n ,N + sa ni N < 0, i.e., when 
s £ (— oo, — jin,N / &n,N) ■ Hence, we have 

0N = n / 4>r{s)ds = 2 4>{x;a N - l)dx 

J - — oo «/ 



iY 1 /4( n _ 1 )l/4pl/2( iV + 1 ) 



2 v / 2r 1 /2( n ) ,y 

2 -( QN -l)/2 N l/4( n _ l)V4 r l/2 (re)r ( jV+3) 

~ (at + i)rV2(Ar + i)r 

Applying Sterling's formula 



x a N /2-l e -x/2 L a N -l 



[ - 

Z 



we obtain that 



r 1V2 1/4, M/4 r*rV /4 M B/2 ^ V^^V^^ 

P7V ^y /4 teV iV+1>/a rjs-V /a r2iiV" +1>/a 

l^Af+i y V e / V n+1 / \ 2e J 

i / i \ ™/2 / „ x (Ar+l)/2+3/4 - 

= ^('-^i) + (i + o (J v-)) = -L(i + o (*-.)). 

The last equality is exactly the asymptotics that we need for /3jv- 
A. 1.2 Asymptotics of pjy and pjy 

In this part, we show that the asymptotics of pjy and p~N satisfy 

p N ,~PN = l + 0(N~ l ) . 
We consider p^ first. By definition, we have 

PN = = = • 

Pn,N fJ"a-l,N-l 

Plugging in the definition of (T n -i,jv-i an d Mn-i,JV-ij we obtain that 

1/2 



= N^(n - ( J N _ i + J n _ i ) ^ | __L= + 



1 1 



^-5 V n 2. 



\ 1/4 / \ 1/4 

N \ I n-1 \ 



l + ofiv^ 1 ; 
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For pat, we have from its definition that 

PN = = — 

Pn-2,N &n~2,N Pn-2,N 

/ \ V 2 



l + 0(iV 



— 1> 



The last equality holds since a n -i : N-i/&n-2,N = 1 + (TV x ) as claimed in ([6T]) . which is to be 
shown below in lA,1.3l 



A. 1.3 Properties of A^r and c n _i i jv_i/o'n-2 jv 
We focus on A^r first. As a reminder, we recall its definition as 

Vn-l.N-l — fJ"n-2,N 



&n-2,N 



Bv lEl Karouil (|2006bl . A. 1.2), we have for the numerator that fj, n _i ; N-i — p n -2,N = O (1). For the 
denominator, if we let denote (re — I) / (iV + |) by 7^^, we then have 



&n-2,N 




'1/3 



1 + y/l^N 



O (iV" 1 / 3 ) . 



The last equality holds since 7?!,^ is bounded below for all n > N . Combining the two estimates, 
we establish that 

A at = O (AT 1 / 3 ) . 

We now switch to prove that 



1 < (T n -l,N-l/<r n -2,N = 1 + (A^ 1 ) 



The fact that (T n -i^-\/^n-2,N = 1 + O [N x ) has bee n proved inlEl Karouil (|2006bl . A. 1.3). On 
the other hand, we have from the second last display of lEl Karouil (|2006bl . A. 1.3) that 



&n-2,N 



n + N y J 



1 + 



- - — 

2 ln + iV 



+ 0(n 



Both terms become greater than 1 when N > Nq{^) and hence a n -i^-ia~^_ 2 at — 1 f° r large N. 
Actually, the inequality holds for any n > N > 2. However, what we have proved here is sufficient 
for our argument in Section [5.2.31 
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A. 2 Evaluation of the entries of K T 

In this part, we work out the explicit expressions for the entries of K T given in (|20p . To this end, 
we proceed term by term. 

K T ^i term. For K T n, we have from its definition that 

= * n ,N [S Nj2 (r(s),r(t)) + ^(r( S ))(e0)(r(t))] 
= 5 T (s,t)+<7 n ,jv^(r(a))(e0)(r(t)). 

For the second term in the last expression, we have cr n< N^)(T(s)) = tp T ( s ) an d 



<p(z)dz = / 4>(t(u))t' (u)du = @n,N / (j)(T(u))du = / 4> T {u)du. 

y Jt Jt Jt 

Hence, the second term equals ip T (s)(e(j) T )(t) and we obtain 

Xr,u(a,t) = S T (s,t) + Vr(s)(£0r)(*)- 

-f^T,i2 term. We first recall the definition of 1^,12 as 



K rA2 (s,t) = -a ntN y/T'(s)T'(t)d 2 S N7 i(T(s),T(t)). 
For the involved partial derivative, we have 

C2^Ar,i(r(sj,r(tjj = — -r- — — ===== = — = = — d t b T (s,t), 

with Si* defined as in (fl9j) . Observing that r'(s) = r'(t) = cj n jv> we obtain 

tf T ,l2(M) = 
K T 2\ term. By its definition, we have 

Kr,2i(M) = [^iWj),^)) - e(t(s) - r(t))] • 

&n,N 

Observing that r is a monotone transformation, we obtain 

e(r(s)-r(t)) = e( S -t). 
For the quantity eS^iC^s), r(t)), by using the above identity, we have 

eS N)1 (T(s),T(t)) = J S NA (T(u),T(t))e(T(s)-T(u))T'(u)du = J S?(u,t)e(s-u)du = (eS?)(s,t). 

Plugging all these identities back into the definition of K T)2 \, we obtain the expression 

K T , 2 i(s,t) = {eS?)(s,t)-e(s-t). 

Kt,22 term. The formula for K Tj22 is obtained directly from that of if T) n by switch s and t. 
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A.3 Behavior of R N (£), R' N (£) and k%*C 

In this part, we investigate the behavior of Rn(£), R' n (£) and rej^C which is essential in deriving 
the Laguerre asymptotics. Before we start, we remark that throughout our discussion, we consider 
only the case where s E i^jv = [so, siiV 1 / 6 ). 

A.3.1 Properties of R N (£) and R' N (£) 

Recall the definition R N (Q = (C(£)/C(£+)) 1/2 > w e obtain that 

Rn^+) = 1, and i^(0 = -i& /2 C(0" 3/2 C(e)- (64) 

By our derivation in the Liouville-Green approximation, we know that £ = + str^ jv/^jv 
and as has been shown before, when TV > Nq^), 



AT2/3 

As iV — ► oo, we have 



f ^) < 4 (1 + 1/^7) 1/3 (1 + V7) (1 + 7)- 1 < C. 



sup = 0(siiV~ 1/2 ) -+0. (65) 



We then have the following first order Taylor expansion 

Rn(0 = Rn(H + ) + R' N (n(Z - £+), for some E [£ A V £+]. 
Hence, when iV > iVo( s o>7)i we have 

|i?iv(£) - 1| < \R' N (0\ 4 (1 + 1/^7) 1/3 (1 + V7) (1 + 7) _1 ^" 2/3 |a|- (66) 

In order to bound |i?jv(£) — 1| uniformly on I\ jv and also of its own interest, we are to derive 
a bound for \R' N {£,)\ by some constant that does not depend on N and is uniform for s G I\ t N- 
By the definition of R' N (£) in 1(631) . this relies on the understanding of the quantities Ov, and 

C(0- 

First, we consider the asymptotics of Cat- Using the notation = m ± 1/2, we obtain from 
simple calculation that as N — > oo, 

. nf< 6 (n + +iV + ) 1 / 3 7 1/6 (1+7) 1/3 

*iV = — . , ,o ► — (67j 



2i/3(^H + ^AT7;) 4 / 3 2V3(i + ^) 



4/3 " 



Second, we check the behavior of C(£)- For simplicity, we let £± = limAr^oo^-i- and simple 
manipulation gives us 

^° = 2(l + V7) 2 /(l+7), and £f - £° = ty7/(l +7)- 
We assume first that sq > 0. By the definition of for £ > £+, we recognize it as 

C(0 



" 1/3 V(e-^+)(e-£- 



24 
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When s G I\ n with sq > 0, we always have the bounds 



1 < 



1/2 



i< 



1/2 t 



(68) 



Plugging these bounds into our modification of C(£)> we obtain the lower and upper bounds for 
C(£) as 



d /3 ^-e-) 



1/3 



< C(0 < 



(e-e- 



a/2 



2 2/3^ " MV - 2 2 / 3 £ 2 / 3 (£+ - £- ) 1 / 6 ' 

where as iV — > oo, both bounds converge to the same limit: 

2 J/3 ( ^)2/3 = 1 }™J N - 

We remark that because of (|65[) . the convergence is uniform on i^jv, which is crucial for deriving 
finite iV bounds from the limit. 

If so < 0, we only need to consider the case where s E [so,0], for the case where s > has 
essentially been considered in the above derivation. When s E [so,0], the definition of is 
changed to 



ao 



+ y/(S+-z)(z-Z-) 



-1/3 



2.; 



dz 



y/(Z+-0(S-Z- 



2£ 



1/2 



A+ - e- 

In this case, we have for all s £ [so,0], 



2£ 



-1/3 



2£ 



< 



1/2 



< 1. 



(69) 



We notice that all the bounds tend to 1 when N — ► oo. Hence, plugging these bounds to our 
modification of we obtain the lower and upper bounds for it that tend to the same limit as 
when sq > 0. Thus, we conclude for £(£) that when TV > iVo(so,7), 



C\ lim Ctv < C(0 < C*2 lim (n, for all s S Ii jv. 



(70) 



Such a derivation is valid, since the convergence to the limit is uniform for s G /1,7V. 

Finally, we study the behavior of £(£)• To this end, we first derive a convenient representation 
for it. By the definition of £, we have (() 2 = /C _1 - We then take derivative with respect to £ on 
both sides and collect to get 

f'C-fC 



c 



2CC 2 



Furthermore, we plug in £ = //C 2 and obtain the final representation as 

;■• A C 1 
^ 2/ • 
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Noticing the definition of /, we could regard the above representation as the product of three 
factors: C(0> (/'(£) ~ C(£) 3 )/(£ - £+) and 2£ 2 /(£ - £-)■ The first factor C has already been 
studied. We first investigate the second factor: (/'(£) — C(£) 3 )/(£ — £+)• 

As before, we start with the assumption that so > 0. By the definition of /, we have 



4 £2 



4^2 



2^ 



For /'(C) - C(0 3 > w e consider first the quantity 1(C) = (£ - £-)/4£ 2 - C(£) 3 - B Y USED and 
straightforward calculation, we obtain 



1/2' 



4£ 



<2T(0<(1-^ 



4^2 



Hence, we obtain that when ./V > iVo(so>7)i for all s £ I\ 



N 



< 



and hence 



f'(0 - m : 



< 



C 



1 2(£-£_) 9 
< h — — z—L < — 

- 4? 2 I" £ 3 4^2 - (qc)2 



2? 



< 



Moreover, when TV > Nq(sq,^), we could also have 

C(0|<(^-r) 1/3 (£?r 2/3 , and 
Multiplying the three bounds, we finally obtain that when N > Nq(sq,^), 

C(0 < C(^)" 2/3 (C - )" 2/3 = C 7 - 1 /3(i + V7)" 4/3 (l + 7) 4/3 , for all s € I ljN . 



£00 _ £OC 



(71) 



We remark that when so < 0, we just focus on s £ [so,0]. In this case, the quantity Z(£) 
becomes 



4^2 



2e 



3 f i+ ( z-j. 

2y s U+-e- 



1/2 



jz 2£ 



-i -l 



with (|69p holds. Everything else follows just as in the study of C(£)- I 11 particular, (|7ip still holds. 

Finally, by the definition of R' N (£) in ([M]) and our analysis of Cat, C(0 an d C(£)j we have that 
when N > iVo(so,7), 

TV- (72) 

(73) 



i?Ar(0| < C 7 - 1/2 (l + 7), for all a G /i 



for all s €. Ii 



This bound, together with ()66[) . gives 

|i?jv(e) - 1| < CV7(1 + V7)(l + 1/^7) 1/3 A^ 2/3 * < CW~ 2/3 |^ 

A. 3. 2 Behavior of k% 3 ( 

Exploiting a simple Taylor expansion at to the second order, we obtain that 

= «7V 3 C(C+) + ^ 1/3 Cr niAr CArS + ^K^ 4/3 CT 2 )Ar C(r)s 2 . 
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— 1/3 ' 

Recalling that C(£+) = an d that (T n ,N K N Cn = 1, we obtain that 



1<V/vC(D 2 



4 /3 C(6 - - 

2 KJV CtV 



-s . 



According to our previous discussion, we have 



^L = ( N ~VA and ^0 = 0(1), for all * 6 I 1>N . 
v ' Cn 



Hence for all s G /i,jv> when TV > No(so,"f), we have 



4 /3 c(0 - - 



< CN~ 2 / 3 s 2 . 



Note that on Ii at, |s| < siiV 1 / 6 and hence we could modify the above bound to be 



4 /3 C(6 - s 



< (cN- 2/3 s 2 ) Ay M, for all s £ I hN . 



(74) 



A. 4 Choice of s\ and its consequences 

The key point in our choice of s\ is to ensure that when s > s\, we have 

1*nC 3/2 > \s. (75) 

To this end, recall that in I Johnstone! ( 200ll . A. 8), one could choose §1(7) = C{pf)(l + 5) with 
some 5 > 0, such that when s > §1(7), we have /(£) > 2ja n ^ and hence if s > 4^1(7), 



KN( 3/2 = KN \/f(z)dz > K N (s-Sl(7)) 



5+ 



cr n ,N 



= 2(s-si( 7 )) > -s. 



Moreover, by the analysis in |E1 Karouil (|2006bl . A. 6. 4), £1(7) could be chosen independently of 7 
and hence we could define our s\ to be 

si = 4si 

which is independent of 7 and such that (|75p holds. Moreover, for our convenience of arguments, 
we could also impose the constraint that s\ > 1. 

After specifying our choice of s±, we spell out two of its consequences. The first of them is 
that when s > s± > 1, 

E_1 (4 3 < Cexp(-3s/2) < Cexp(-s). (76) 
This is from the observation that E(x) > Cexp(|x 3//2 ) and hence 

E~ 1 (4 /3 C) < Cexp (-1 KN (V^ < Cexp(-3 S /2). 

The other consequence is about the behavior of s' defined in (I60p when s > s±. Remembering 
that s% > 1, we then have that when s > s\ and N > Nq(j), 



s'-- = A N + 



! ) s > A N + — >Aat + ->0. 

<7n-2,7V 2/ 2 2 



(77) 



The last inequality holds when N > N (j) for Ayv = O (N^ 1 / 3 ). 
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A. 5 Proofs of Proposition Q] and Lemma [4] 
A. 5.1 Proof of Proposition Q] 

By the definition (|16p of the determinant for operators in class A, we have a first decomposition 



as 



|det(J - A) - det(J - B)\ = |det 2 (7 - A) exp (-trA) - det 2 (7 - B) exp (-trS)| 

< |det 2 (7 - A) - det 2 (7 - B)\ [|exp (-trA) - exp {-tvB)\ + exp (-tr5)] 
+ |det 2 (I - B)\ |exp (-trA) - exp (-trJ3)| . 

According to Gohberg et al. ( 2000l . p. 69, Theorem 7.4), we have the bound for the 2-determinant 



MS 



|det 2 (I - A) -det 2 (I -B)\ < \\A-B\\ 2 exp 
Moreover, for any A, B 6 A, the Hilbert-Schmidt norm satisfies 



^(1 + \\A\\ 2 + \\B\\ 2 ) 2 



\\A — B\\ 2 < W-Aii ~ Ba\\ 2 + ^2 W-^ij — Bij\\ 2 < W-Aii — Ba 111 + ^ \\Aij — Bij\\ 2 . 
i i i^j 

Recalling that for any trace class operator A, tr(A) < ||A||i, we obtain 

exp (-trfl) < exp |trfl| < exp (|tr£ u | + |trB 22 |) < exp (||-Bn||i + ||£ 22 ||i) . 

Observing that for |x| < 1/2, \e x — 1| < 2\x\, we obtain that, when Ei \\Au — Ba\\i + 
Ei^j \\A-ij - Bij\\ 2 < 1/2, 



| exp 



(-trA) - exp(-tr£)| < 2exp(-tiB) |trA- trJ3| < 2^ \\A U - ^ii || le 11 1511 11 1+11 jB2211 1 



<2 [ J^IIAii-Stflli+J^II^- -Bij\\2 exp(||J3u||i + ||fl 2 2||i)- 
Plugging all these bounds into our first decomposition, we obtain an intermediate bound as 



M(A,B) (Zi \\An - + £ ¥i Uij - B 
M(A, B) = 2 |det(J-S)| +2exp 



ij\\2 I, 



where 



Ul + \\A\\ 2 + \\B\\ 2 ) 2 + J2\\Bn\\i 



Under the given condition, 

1 + \\A\\ 2 + \\B\\ 2 < 1 + 2||B|| 2 + \\A - B\\ 2 



< 1 + 2||J3|| 2 + " B «Hi + Yl H A v - B vh < 2 + 2 II B I 



which reduce M{A, B) to the constant M{B) claimed. 
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A. 5. 2 Proof of Lemma H 

By definition, we have 



K N 



4& 



a 



Thus, we obtain from direct calculation that 



2( 



2^4 



&n,N 
K N 



K N 




°l,N 





B Logarithmic Transformation and the Smallest Eigenvalue 

In this part, we give a brief account of how one could derive the similar second order accuracy 
results claimed in © and ([7J) with a logarithmic tr ansformation. In m any aspects, the derivation 
here for Laguerre orthogo nal ensembles [as based on lAdler et al. I <|200CL Proposition 4.2)] is parallel 
to what iJohnstond (120071 ) did for Jacobi orthogonal ensembles. 



B.l Logarithmic transformation for the largest eigenvalue 

For the largest eigenvalue, we assume the same setting as that in the beginning of Section | 
With (j> k defined in fl]3j), let 

4>k(x;a) = {-\y(t) k {x;a)/yfx. (78) 



Then setting a at = N(N + on — 1), we have the following alternative way of expressing £V,i 
in term of S kj 2i the correlation kernel occurring in LUE(fc, a) model: 



\/^Sn-i,2(x, y; a N ) + J — — 4> N ^i{x; a N )(e(p N ^ 2 )(y; «jv)- (79) 



As a comparison, the central formula (|15p could be rewritten as 

SN,i(x,y;a N - 1) = S N ^ 2 (x,y;a N - 1) + (p N -i(x; a N )(s<f> N )(y; a N - 2). 



The equivalence of the above two representations is given in lAdler et al. Appendix) and 

hence omitted here. 

We make use of the representation (|T9|) to give an alternative second order accuracy argument 
with a logarithmic transformation. Recalling a at = n — N, we define 



k+l + \/k + (XN + l 



2 / ' 




Then we let 



k + \ \/k + aN + \ 



fe (x) = (-l) fc ^ '- K —j= '- — x x l 1 4> k (x;a N ). 

For Sn-i,2(u, v; a^) = Sjv-i,2(e", e v ] ajv) e u / 2 e v / 2 , we could represent it as 

/•oo 

S N -x, 2 (u,v;a N ) = / [^(e^)^^) + ^ N ^{e u + w )4> N „ 2 {e v+w )] dw. 
Jo 
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We then define 

v n ,N = log/ijv-i, r n ,N = <TN-l/VN-l, and r(s) = ex.p(v n ,N + STu^n) ■ 
The r-transformation induces the following transformed Laguerre polynomials: 

tpr(s) = T ni Ar^Ar_i(T(s)), (j) T (s) = T n , N (j) N - 2 (T{s)). 



Define S T (s,t) = \/t' (s)t' (t) Sn-i, 2 (t(s), r(t); ajy), we have the following integral represen- 
tation from the expression for the Sjv-1,2 kernel: 

/■oo 

St(s, t)= [<t>r(s + z)i> T (t + z) + <f> T (t + z)ip T {s + z)]dz. (80) 
J 

Moreover, if we define the following quantities [fix sq £ M, with s,t > sq] 

Qn{s) = y/T'{s )/T'(s), and S?{s,t) = S T (s,t) + t/v(s)(e0 T )(t), 

we have 



ifyi(a') = P(xi < r(s')) = P(QogX! - y n ,N)lr n , N < s') = y/tet{I-K T ), 

where the new operator K T has a 2 x 2 matrix kernel with entries given by 

K r ,u{s,t) = q N (s)qJ l 1 (t)S^(s : t); K T>12 (s,t) = -q N (s)q N (t)d t S?(s,t); 
K T;21 (s,t) = q^ 1 (s)q- 1 (t)[e 1 S^(s,t) - s(s - t)]; K Tj22 (s,t) = K Tjll (t, s). 



(81) 



By Proposition [TJ we need to obtain entrywise bound for K T — Kqoe here. To this end, a 
convenient representation of the kernel difference as in Section 13.41 is most helpful. 
For the transformed Laguerre polynomials <j) T and ip T , we have 



oo 

ip T = 0, and 

-oo 



(N - l)V 4 (n - l^W^n - 1)Y (^) 

2 a *- 2 {N- i)rV2(iv- i)r(§) 



For notational convenience, let /3jv = | <j> T = + O (iV x ) . 

With the replacement of e by e in (|24p and the matrices L, Li and L2 introduced in Section 
3.41 we obtain that 



K T = Q N (s) [K? + + < 2 + iT] Q^(t). 
with the unspecified components given by 

= Z [S T ~ Vr ® £0r] , ^ = [Vv (a)] , < 2 = #^2 [VV (*)] , 

where Qn{s) = diag(gjv(s), qj^(s)) and as before G(s) = Ai(s)/\/2- 
For 

log/iAT_i - log/lAT_ 2 

A^V = / , 

&N-2/ 

set Gat = G + AjvG', we have 

K?-K R = L [S t - S A - i) T ® ecp T + G ® e(G7v - AjvG')] 

= X [5 r - 5^ + AjvG ® G] - £ [</v (8) e^ r - G g> eGV] 
= ^ + tf. 
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If we write Sa n = Go Gn + Gn o G, we have S^ 1 = L[S T — Sa n ]- 
Finally, we organize K T — Kqqe as 

K T - K GOE = 5 R > D + 5 R J + 5% + S[ + 5% + 6 E , 

where the unspecified terms are defined as the following: 

5 R ' D = Q N {s)K?Q- N \t)-K?, 
Sf = Q N {s)K^Q-\t) - Kf, for i = 1,2, and 
8 £ = Q N (s)K £ Q- 1 (t)-K £ . 

With the ab ove representation of the kernel difference, we could apply the machineries in 



Johnstonel (j20Q7h to obtain the desired second order accuracy of the Tracy- Widom approximation 



to the distribution of (logxi — v n ,N)/T n ,N- After establishing the result in RMT notation, we 
replace N by p and hence obtain the bound in (|5|). 

B.2 The smallest eigenvalue 

We first restate the claim in (J7|) in a more friendly way. Let v~ N and r~ N be the centering 
and scaling constants defined in ([6]), with p replaced by TV. Then for xjy the smallest eigenvalue 
in the model (|8]) [with a = — 1], there exists a continuous and nonincreasing function C(-), 
such that for all real so, there is an integer A^o(sO)7) f° r which we have that for any s > sq and 
N> iV (s ,7), 

P{log.XAT > u~ N - st~ n } - Fi(s)| < C( So )iV~ 2/3 exp(- S /2). (82) 

Fix xo > and consider any x' G [0, xo\. To prove ([82]) . we first observe that for xn in model 
), when N is even, choosing % = Io<x<x', we have 



P{x N > x'} = yjdet{l - K NX ), 



where Km is the same operator as for x\, which has the kernel (lllj) . If we think of Kn as Hilbert- 
Schmidt operator on L 2 ([0,x']; p) © L 2 ([0,x']; p" 1 ) with p any weight function chosen from some 
proper class, then the above formula changes to 



P{x N > x'} = y/det(I-K N ). 

Introduce the transformation 

t(s) = ex.p{u' N - st' n ), 

and let so = r~ 1 (xo) and sq < s' = t~ 1 (x'), we have r _1 ([0,x']) = [s',oo). By defining 
4> T = — t~ n <Pn-i(t(s)) and ^ T = —t~ n 4>n_2{t(s)) and using the alternative representation 
(I79|) . the formal derivation for the largest eigenvalue in IB. II could be carried out analogously for 
the smallest eigenvalue. In particular, we have the integral representation (|80p for S T (s,t) = 
v /r / (s)r / (t)5Ar_i i2 (r(s), r(i); a N ) and 

P(x N > t(s')) = P((logx N - u- N )/r- N > -s') = Vdet^-i^), 

with K T thought of as Hilbert-Schmidt operator on L 2 ([s' , oo); p o r) © L 2 ([s', oo); p~ l o r) with 
entries given by (|8ip . We remark that the actual definition of (j) T and ip T used in these formulas 
have changed, albeit the formal representations remain the same. 
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The rest of the proof for the smallest eigenvalue becomes the routine procedure of a) finding 
a representation for the kernel difference K T — Kqoe and b) studying the asymptotic behavior 
of the transformed Laguerre polynomials (f> T and tp T . The former is very similar to the largest 
eigenvalue case while the latter could be obtained by applying the Liouville-Green approach to 
analyze the behavior of the solution to the differential equation (|48p around the lower turning 
point 
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